last night me (and my boss
) got woken up at 4 am because of a "zabbix agent nor responding" alert. We use {tmpl_agent_important:agent.ping.nodata(300)}=1 as alerting trigger (send sms if no data for 5 minutes, agent.ping is checked every 30 sec) .
It seems that the agent did not return any data for about 7 minutes. At this time, the monitored host was unter load, doing logrotates.
Has anyone had similar problems with Linux agents? (I've found similar threads, but all talking about windows only)
Could you give advice on how to work around this false positive? I don't just want to disable alerting between 03:45 and 04:15 because I fear I could miss real alerts. Any better options?
Environment: Zabbix 1.4.2, Server Centos 5, Agent Centos 4.5
) got woken up at 4 am because of a "zabbix agent nor responding" alert. We use {tmpl_agent_important:agent.ping.nodata(300)}=1 as alerting trigger (send sms if no data for 5 minutes, agent.ping is checked every 30 sec) .It seems that the agent did not return any data for about 7 minutes. At this time, the monitored host was unter load, doing logrotates.
Has anyone had similar problems with Linux agents? (I've found similar threads, but all talking about windows only)
Could you give advice on how to work around this false positive? I don't just want to disable alerting between 03:45 and 04:15 because I fear I could miss real alerts. Any better options?
Environment: Zabbix 1.4.2, Server Centos 5, Agent Centos 4.5
Comment