[SOLVED] (see last post)
Ok... this is so weird...
I deployed zabbix on one of my clients to monitor their windows servers and i created 10 hosts and assigned them the windows os template and installed agents on every host.
But every 5 minutes or so, i have triggers saying that agent on some machines are unreachable for 5 minutes (agent.ping)
Then i try to manually ping those machines from terminal when that happens, it misses 100% of the packets. The only way i can ping those machines again is to stop zabbix-server service and then the machines reply ping packets again.
While this happens, i can ping those machines from every other machine on the network with 0% packet loss, so it is a zabbix server issue only.
If i leave that problem as it is and wait 10-15 minutes, the triggers go OK and then i can manually ping those machines again until 5-10 minutes when they fail again.
The zabbix server is on an ubuntu 14.04 LTS virtual machine with full apt-get update and upgrade done, the machine doesnt seem resource intensive even when this problem happens:
This is the process TOP record:
http://pastebin.com/mk35UkxU
This is zabbix.conf without comments:
http://pastebin.com/BRF59KVL
If you need more info tell me what you need.
Anyone know what happens here?
Ok... this is so weird...
I deployed zabbix on one of my clients to monitor their windows servers and i created 10 hosts and assigned them the windows os template and installed agents on every host.
But every 5 minutes or so, i have triggers saying that agent on some machines are unreachable for 5 minutes (agent.ping)
Then i try to manually ping those machines from terminal when that happens, it misses 100% of the packets. The only way i can ping those machines again is to stop zabbix-server service and then the machines reply ping packets again.
While this happens, i can ping those machines from every other machine on the network with 0% packet loss, so it is a zabbix server issue only.
If i leave that problem as it is and wait 10-15 minutes, the triggers go OK and then i can manually ping those machines again until 5-10 minutes when they fail again.
The zabbix server is on an ubuntu 14.04 LTS virtual machine with full apt-get update and upgrade done, the machine doesnt seem resource intensive even when this problem happens:
This is the process TOP record:
http://pastebin.com/mk35UkxU
This is zabbix.conf without comments:
http://pastebin.com/BRF59KVL
If you need more info tell me what you need.
Anyone know what happens here?
Comment