Hi All,
I have been having a weird problem with my Zabbix 2.4 for a few weeks and I haven't been able to pinpoint it.
For the record, I have been running it for a while without any issues. Yes it is in a virtual machine and the VM has 3 GB of ram allocated to it. Before anyone points to the virtual environment, the physical server is barely doing anything at all.
The Zabbix VM has 4 CPUs assinged to it and is running on Centos 6 x64. As I said it has worked well for a year or so until it got crazy.
There are 49 monitored hosts, 2742 items and 462 triggers. This is as per the Status dashboard.
The issue is, Zabbix starts saying that it the ICMP response times and loss are too high. This is somewhat random and can happen to pretty much any host, even a host that is right next to it on the same IP network. This issue obviously causes all sorts of problems as it appears that things keep going up and down and we get hundreds of alerts that we didn't have to get.
If I run pings to these hosts from the command line it I can see the issue happening, the response times go up for a few pings and then back down. But if I stop Zabbix and run the same pings, and I have tested it for long periods of time, the problem does not happen.
I thought there were issues with the database but it doesn't seem to be that, I am using mysql and the database is 4.1GB which is not very large.
Does anyone have any idea to share that could help me figure out the problem?
Thanks a lot in advance.
Vini
I have been having a weird problem with my Zabbix 2.4 for a few weeks and I haven't been able to pinpoint it.
For the record, I have been running it for a while without any issues. Yes it is in a virtual machine and the VM has 3 GB of ram allocated to it. Before anyone points to the virtual environment, the physical server is barely doing anything at all.
The Zabbix VM has 4 CPUs assinged to it and is running on Centos 6 x64. As I said it has worked well for a year or so until it got crazy.
There are 49 monitored hosts, 2742 items and 462 triggers. This is as per the Status dashboard.
The issue is, Zabbix starts saying that it the ICMP response times and loss are too high. This is somewhat random and can happen to pretty much any host, even a host that is right next to it on the same IP network. This issue obviously causes all sorts of problems as it appears that things keep going up and down and we get hundreds of alerts that we didn't have to get.
If I run pings to these hosts from the command line it I can see the issue happening, the response times go up for a few pings and then back down. But if I stop Zabbix and run the same pings, and I have tested it for long periods of time, the problem does not happen.
I thought there were issues with the database but it doesn't seem to be that, I am using mysql and the database is 4.1GB which is not very large.
Does anyone have any idea to share that could help me figure out the problem?
Thanks a lot in advance.
Vini
Comment