Last night we had a power outage that affected our entire network. At present I am monitoring 314 hosts. This morning when the power came back online I have 6 hosts telling me that they are not responding to an ICMP ping. Yet it then tells me that 5 of them are up, one is down. All 6 units are actually up and running with no problems. How do I get rid of these non problems?
Ad Widget
Collapse
Hosts that are up and down at the same time
Collapse
X
-
What template do you use for these hosts? How does your item and corresponding trigger look like? Are you sure, that these hosts respond to ping correctly? Zabbix is using fping for ICMP checks, so you can try to check manually. -
First, I am using the default ICMP ping that is built into Zabbix via a Ubiquiti and Mikrotik Template. I know the templates are good, as almost all og the 330 hosts I have are either Mikrotik devices or Ubiquiti Devices. As for the pings, yes they are responding to pings. I have pinged them using both ping and fping from several windows machines, and several linux machines, including the one that Zabbix is running on. I can also ping them from other mikrotik and ubiquiti devices. I can also access all the devices using any number of methods. They all respond normally. Thanks for your help.Comment
-
In standard "ICMP Ping" template hosts are treated as unavailable only after 3 icmping items obtainings, which occur once in a minute by default. So only after 3 minutes you trigger will fire. Same way, you recovery expression is the same as the trigger expression and will set trigger to OK state only after 3 successfull pings, which is also 3 minutes.Comment
Comment