Hi guys,
I have a custom item set up, which calls a script on the host being monitored.
This has been working fine, and have tested with zabbix_agentd -t.
I am now seeing log entries on the proxies:-
1642:20170731:093704.763 Zabbix agent item "net.check.private.ips" on host "host1" failed: first network error, wait for 15 seconds
1643:20170731:093719.905 resuming Zabbix agent checks on host "host1": connection restored
1645:20170731:093723.923 Zabbix agent item "net.check.private.ips" on host "host1" failed: first network error, wait for 15 seconds
1643:20170731:093739.016 resuming Zabbix agent checks on host "host1": connection restored
All other checks on that host work absolutely fine, however Zabbix is reporting that due to a network failure that one check (out of about 30) is down. Zabbix also reports that it loses connectivity to the agent, then a few seconds later the "connection restored" message is displayed.
The only thing I have noticed so far is that the script is taking a little longer than usual, but I have extended the "Timeout=" value in the agent config to 10 seconds. Unfortunately this hasn't helped.
Has anyone seen this before?
I have a custom item set up, which calls a script on the host being monitored.
This has been working fine, and have tested with zabbix_agentd -t.
I am now seeing log entries on the proxies:-
1642:20170731:093704.763 Zabbix agent item "net.check.private.ips" on host "host1" failed: first network error, wait for 15 seconds
1643:20170731:093719.905 resuming Zabbix agent checks on host "host1": connection restored
1645:20170731:093723.923 Zabbix agent item "net.check.private.ips" on host "host1" failed: first network error, wait for 15 seconds
1643:20170731:093739.016 resuming Zabbix agent checks on host "host1": connection restored
All other checks on that host work absolutely fine, however Zabbix is reporting that due to a network failure that one check (out of about 30) is down. Zabbix also reports that it loses connectivity to the agent, then a few seconds later the "connection restored" message is displayed.
The only thing I have noticed so far is that the script is taking a little longer than usual, but I have extended the "Timeout=" value in the agent config to 10 seconds. Unfortunately this hasn't helped.
Has anyone seen this before?
Comment