When any device's service check goes critical hard we check if the device is up or down by sending out an ICMP ping.
When there are "Parents" configured we do NOT send out an ICMP ping to check if they're up, but rather check the state in the database.
Using this logic there is the potential for a 3 minute gap where a race condition can happen and we send out alerts for all children + parent separately.
I am proposing that we change the logic to actively send out an ICMP ping to the parents as well to determine if they are up rather than relying on a potentially stale database status.
Netreo Product Area | Netreo (IM) |