Hi all,
Apologies in advance if I don't have enough information, but I am trying to troubleshoot an issue that I think could have any number of reasons so at the moment I am just trying to narrow it down.
We have a Zabbix instance running version 3.2.2, originally installed using the appliance I believe. The queue numbers are consistently very high (currently 500+ delayed more than 10 minutes) and we are missing large chunks of item data from each of our hosts across pretty much all monitored servers. I have attached a couple of screenshots showing the missing data on one of our Windows servers, as well as the Zabbix Queue - Overview by proxy view. Please note that the top proxy monitors 80 of our servers, the second one down 11 and the bottom one none which is why the queue is much larger on the first proxy in the view.
I have read a few forum posts as well as the documentation and I am a bit stuck at where to start on my troubleshooting. Some other things I have noted:
Apologies in advance if I don't have enough information, but I am trying to troubleshoot an issue that I think could have any number of reasons so at the moment I am just trying to narrow it down.
We have a Zabbix instance running version 3.2.2, originally installed using the appliance I believe. The queue numbers are consistently very high (currently 500+ delayed more than 10 minutes) and we are missing large chunks of item data from each of our hosts across pretty much all monitored servers. I have attached a couple of screenshots showing the missing data on one of our Windows servers, as well as the Zabbix Queue - Overview by proxy view. Please note that the top proxy monitors 80 of our servers, the second one down 11 and the bottom one none which is why the queue is much larger on the first proxy in the view.
I have read a few forum posts as well as the documentation and I am a bit stuck at where to start on my troubleshooting. Some other things I have noted:
- The Zabbix central server monitors itself and doesn't seem to miss any data at all, nor are any of its items in the queue
- There are constant errors in the Zabbix server log (/var/log/zabbix/zabbix_server.log) relating to ESXi monitoring timeouts. I have found that the credentials for the hosts are wrong so I will be getting that fixed. I have temporarily disabled them to see if the logs clear and if there is an impact on the queue numbers
- There are some errors relating to active checks (Cannot send list of active checks) on five hosts in the Zabbix proxy logs so I will clear them by fixing the agent hostnames
- All queued items are of type 'Zabbix Agent'
- The MySQL DB on both proxies looks like it hasn't had any tuning applied (InnoDB settings in my.cnf are default)
Comment