SYSTEM DESCRIPTION
----------------------
Zabbix Server 1.8.11 is installed on Ubuntu Linux 12.04.
A Windows Server 2008 R2 server has the following agents:
1) Standard windows zabbix agent
2) Two "Jabcat" agents imbedded in two Java applications
There are many servers on the network, and each of them has a similar setup (the windows zabbix agent and one or more Jabcat agents).
PROBLEM DESCRIPTION
----------------------
Everything has been working just fine for several months now. However, one of the Windows servers' zabbix agents have all suddenly begun behaving strangely.
I see in the Zabbix Server logs where it complains about "network errors", it waits for 15 seconds and sometime reconnects, but soon fails again. Sometimes, it cannot connect for so long that the agents' triggers start going off. Every single zabbix agent on this one Windows machine is failing in this way, and no other machine is having this issue.
TROUBLESHOOTING PERFORMED
-------------------------------
1) Ping from Zabbix Server -> Windows machine lost no packets while running for 15 minutes.
2) Ping from Windows machine -> Zabbix Server lost no packets while running for 15 minutes.
3) CPU / Memory usage looks fine on both the Zabbix Server and Windows Machine.
4) Windows system logs show no errors during the time preceding this behavior's beginning.
5) No errors in Zabbix Server's system logs, etc.
6) On the Windows system, I have many network-dependant processes running, and none of them are having network connection issues. Only the Zabbix Agents (all 3 of them) are having an issue.
Any ideas, guys? I have spent a day and a half tracking this down, and I am officially at a loss.
----------------------
Zabbix Server 1.8.11 is installed on Ubuntu Linux 12.04.
A Windows Server 2008 R2 server has the following agents:
1) Standard windows zabbix agent
2) Two "Jabcat" agents imbedded in two Java applications
There are many servers on the network, and each of them has a similar setup (the windows zabbix agent and one or more Jabcat agents).
PROBLEM DESCRIPTION
----------------------
Everything has been working just fine for several months now. However, one of the Windows servers' zabbix agents have all suddenly begun behaving strangely.
I see in the Zabbix Server logs where it complains about "network errors", it waits for 15 seconds and sometime reconnects, but soon fails again. Sometimes, it cannot connect for so long that the agents' triggers start going off. Every single zabbix agent on this one Windows machine is failing in this way, and no other machine is having this issue.
TROUBLESHOOTING PERFORMED
-------------------------------
1) Ping from Zabbix Server -> Windows machine lost no packets while running for 15 minutes.
2) Ping from Windows machine -> Zabbix Server lost no packets while running for 15 minutes.
3) CPU / Memory usage looks fine on both the Zabbix Server and Windows Machine.
4) Windows system logs show no errors during the time preceding this behavior's beginning.
5) No errors in Zabbix Server's system logs, etc.
6) On the Windows system, I have many network-dependant processes running, and none of them are having network connection issues. Only the Zabbix Agents (all 3 of them) are having an issue.
Any ideas, guys? I have spent a day and a half tracking this down, and I am officially at a loss.
Comment