For the past two days, I have had this problem:
I will log into Zabbix and notice that it thinks things are down that are not actually down. When I look further into it, I can see that no checks have been run on any external hosts for the past 16 hours (or any number of hours).
I am still getting checks for the client running on the same box as the Zabbix server, but not for any external clients. The way to resolve it seems to be restarting zabbix-server.
When I look in the zabbix-server.log, I see that entries for any of the external hosts just all of a sudden stop at one point. After that I see only warnings and errors for the host machine itself.
What could cause zabbix to stop checking all but the local client (127.0.0.1)? Monitoring can only be useful if I can trust it will continue to monitor without my intervention, so I'm hoping to figure this one out.
I will log into Zabbix and notice that it thinks things are down that are not actually down. When I look further into it, I can see that no checks have been run on any external hosts for the past 16 hours (or any number of hours).
I am still getting checks for the client running on the same box as the Zabbix server, but not for any external clients. The way to resolve it seems to be restarting zabbix-server.
When I look in the zabbix-server.log, I see that entries for any of the external hosts just all of a sudden stop at one point. After that I see only warnings and errors for the host machine itself.
What could cause zabbix to stop checking all but the local client (127.0.0.1)? Monitoring can only be useful if I can trust it will continue to monitor without my intervention, so I'm hoping to figure this one out.
Comment