First a bit about our environment:
Zabbix Server (RHEL7/CentOS7) hosted in AWS with RDS Aurora MySQL
Load Balanced Web Servers (RHEL7/CentOS7) hosted in RDS
18 Proxy Servers distributed globally (CentOS7) with MySQL as the DB
All Servers and Proxies are running version 3.4.7
Here's the unique piece: The proxy servers have both an eth0 network interface and an lo:1 loop back interface. The loop back interface is set up with an IP used in AnyCast so that any agent/device that talks to the proxies can use the same IP. This is working (and quite well too!). The proxies talk to Zabbix on the eth0 interface.
Some time around when the loop back interfaces were created and actived (i think), all of the web scenarios (there are 10) started to fail. Not just some of the time, ALL of the time. The error I am getting is: Step "<scenario name>" [1 of 3] failed: Timeout was reached: Connection timed out after 15001 milliseconds
I wouldn't be concerned if this was a one time thing on one or two of the checks, but its literally ALL of them. I checked the proxy logs and didn't find anything. Can someone please help?! I not really even sure where to start looking to troubleshoot this.
Zabbix Server (RHEL7/CentOS7) hosted in AWS with RDS Aurora MySQL
Load Balanced Web Servers (RHEL7/CentOS7) hosted in RDS
18 Proxy Servers distributed globally (CentOS7) with MySQL as the DB
All Servers and Proxies are running version 3.4.7
Here's the unique piece: The proxy servers have both an eth0 network interface and an lo:1 loop back interface. The loop back interface is set up with an IP used in AnyCast so that any agent/device that talks to the proxies can use the same IP. This is working (and quite well too!). The proxies talk to Zabbix on the eth0 interface.
Some time around when the loop back interfaces were created and actived (i think), all of the web scenarios (there are 10) started to fail. Not just some of the time, ALL of the time. The error I am getting is: Step "<scenario name>" [1 of 3] failed: Timeout was reached: Connection timed out after 15001 milliseconds
I wouldn't be concerned if this was a one time thing on one or two of the checks, but its literally ALL of them. I checked the proxy logs and didn't find anything. Can someone please help?! I not really even sure where to start looking to troubleshoot this.
Comment