I'm using Zabbix 1.8.1. There are several dozens of monitored hosts. 2 days ago a sudden flapping started ("Host unreachable") alerts. I consistently get about 10 emails per minute.
I tried the following:
- increased number of pollers
- changed most checks to Active
- changed "Server Template_Linux is unreachable" trigger expression to
It doesn't help. Furthermore, when I check "Events" for a problem host, it's empty. When I check "Graphs > Ping to the server (TCP)), I see that min=max=avg=1 for much more than 300 seconds. Yet still I get alerts about the host being unreachable.
What else can I do? How can I debug it?
Update: ok, that's just crazy. I disabled "Server is unreachable" trigger and I still get the emails. I made sure these are not some old ones by deleting all alerts from database and restarting Zabbix server.
I tried the following:
- increased number of pollers
- changed most checks to Active
- changed "Server Template_Linux is unreachable" trigger expression to
Code:
{Template_Linux:agent.ping.max(300)}=0
What else can I do? How can I debug it?
Update: ok, that's just crazy. I disabled "Server is unreachable" trigger and I still get the emails. I made sure these are not some old ones by deleting all alerts from database and restarting Zabbix server.
Comment