Hi,
My current setup is:
1 Zabbix server 3.4
2 Zabbix proxies 3.4
Monitoring a total of 40 hosts.
Once in a while (at least twice a week) I get notifications about: Problem: Zabbix agent on {HOST.NAME} is unreachable for 5 minutes
But for all of my hosts! I get that for every host I am monitoring.
I checked my Zabbix servers and they were all ok. No downtime there.
The only log entry I have during the time that happens is:
... Which is not related at all.
What I have checked so far:
What could cause such an error like that?
Cheers!
EDIT 1:
I've noticed that the hosts that are monitored directly from the Zabbix Server did not get that problem. Only the hosts monitored by a Zabbix Proxy host.
I don't see any error message for those Zabbix Proxies. I will increase the log_level to a debug mode and leave like that.. so next time it happens I'll have more data.
EDIT 2:
I was able to find this only Log error:
That server is located in another country and it communicates with "172.30.1.118" (Zabbix Server) via a Site-to-Site VPN.
Maybe the VPN went down for a little bit?
Is there a way to increase the "timeout" option? so I don't get spammed with lots of "Zabbix agent on {HOST.NAME} is unreachable for 5 minutes" errors?
My current setup is:
1 Zabbix server 3.4
2 Zabbix proxies 3.4
Monitoring a total of 40 hosts.
Once in a while (at least twice a week) I get notifications about: Problem: Zabbix agent on {HOST.NAME} is unreachable for 5 minutes
But for all of my hosts! I get that for every host I am monitoring.
I checked my Zabbix servers and they were all ok. No downtime there.
The only log entry I have during the time that happens is:
Code:
failed to send email: wrong answer on RCPT TO "550 #5.1.0 Address rejected."
What I have checked so far:
- Zabbix server did not restart by itself
- Zabbix proxies did not restart by themselves
- Network was up during that time.
What could cause such an error like that?
Cheers!
EDIT 1:
I've noticed that the hosts that are monitored directly from the Zabbix Server did not get that problem. Only the hosts monitored by a Zabbix Proxy host.
I don't see any error message for those Zabbix Proxies. I will increase the log_level to a debug mode and leave like that.. so next time it happens I'll have more data.
EDIT 2:
I was able to find this only Log error:
Code:
1833:20180722:112927.573 cannot send proxy data to server at "172.30.1.118": ZBX_TCP_READ() timed out
Maybe the VPN went down for a little bit?
Is there a way to increase the "timeout" option? so I don't get spammed with lots of "Zabbix agent on {HOST.NAME} is unreachable for 5 minutes" errors?
Comment