I have been using Zabbix for several years now and mostly use it to monitor what I thought would be small to medium site systems,,, 300 to 600 devices with 40,000 items and 6500 triggers. Mostly not much performance issues. We want to upgrade all these to 3.4 and then move to a master server and proxy at each site.
I upgrade the first site from a 3.2 to a 3.4. then I found that the dashboard would have 10 to 15 false positives for the device being unreachable. I tried the normal steps looking at zabbix performance data and making adjustments in the zabbix_server.conf and my.cnf files. Not much help. move the database to a second server. trimmed the history tables.
at the end i did a atop and found the 3.4 was using 80 to 90% disk IO, i restored the 3.2 and check the atop and found the disk io was back down to 20 to 30%
So my question is does the 3.4 really use that much more disk? What can i expect in the 4.0?
It is going to be very hard to sale a full scale roll out of a nation wide zabbix if we have this issue with the 3.4 / 4.0.
Any insight would be help full.
I upgrade the first site from a 3.2 to a 3.4. then I found that the dashboard would have 10 to 15 false positives for the device being unreachable. I tried the normal steps looking at zabbix performance data and making adjustments in the zabbix_server.conf and my.cnf files. Not much help. move the database to a second server. trimmed the history tables.
at the end i did a atop and found the 3.4 was using 80 to 90% disk IO, i restored the 3.2 and check the atop and found the disk io was back down to 20 to 30%
So my question is does the 3.4 really use that much more disk? What can i expect in the 4.0?
It is going to be very hard to sale a full scale roll out of a nation wide zabbix if we have this issue with the 3.4 / 4.0.
Any insight would be help full.
Comment