Hi, I'm running Zabbix 3.4, monitoring about 30 hosts. Since updating to 3.4, some (not all) Windows hosts trigger "unreachable for 5 minutes". That status isn't flapping, it stays in Problem mode for days. The alerts don't clear upon restarting the agent on the hosts, network connectivity between the hosts and the server seems fine,and there don't appear to be any relevant items in the agent or server logs. Zabbix server performance looks fine, with no items in the queue graph. The only thing that restores connectivity is restarting the zabbix-server service. Then it works fine for a day or two until hosts start failing again.
I've increased the "Timeout" setting on the server; increased the StartAgents on the host, and increased the BufferSend and BufferSize values. I've also tried removing the hosts and letting them re-register.
Any suggestions or logs I should attach? Assistance appreciated.
I've increased the "Timeout" setting on the server; increased the StartAgents on the host, and increased the BufferSend and BufferSize values. I've also tried removing the hosts and letting them re-register.
Any suggestions or logs I should attach? Assistance appreciated.
Comment