Hello,
We have implemented a Zabbix 6.2 on a VM running Debian 11. On this Zabbix there are 487 hosts, 93 templates, 30368 items and 13733 triggers. It has 4 CPUs, 8 GB of RAM and 200 GB of storage.
We noticed that when Zabbix is started, we notice slowness in our environment among others on our RDS servers. We investigated our disk array which is an MSA 2050 SAS. On this disk array, we have 2 volumes each in RAID 5 of 9 SAS HDDs, the RDS servers and the Zabbix server are on the same volume. When Zabbix is started with all hosts running, we notice at the MSA level a large increase in IOPS throughput see below, first the datastore with Zabbix started and second when it is turned off :

I did a second test by disabling all hosts on Zabbix and gradually re-enabling them over about 3h, we can see on the volume graph below that as hosts are re-activated the IOPS throughput increases from approximately 50 IOPS to 300 IOPS.
To conclude, we deduce that there is a causal link between Zabbix IOPS throughput and slowness on our environment, but we don't understand how to concretely decrease Zabbix IOPS throughput.
Thanks in advance for your feedback. I am at your disposal for any questions.
We have implemented a Zabbix 6.2 on a VM running Debian 11. On this Zabbix there are 487 hosts, 93 templates, 30368 items and 13733 triggers. It has 4 CPUs, 8 GB of RAM and 200 GB of storage.
We noticed that when Zabbix is started, we notice slowness in our environment among others on our RDS servers. We investigated our disk array which is an MSA 2050 SAS. On this disk array, we have 2 volumes each in RAID 5 of 9 SAS HDDs, the RDS servers and the Zabbix server are on the same volume. When Zabbix is started with all hosts running, we notice at the MSA level a large increase in IOPS throughput see below, first the datastore with Zabbix started and second when it is turned off :
I did a second test by disabling all hosts on Zabbix and gradually re-enabling them over about 3h, we can see on the volume graph below that as hosts are re-activated the IOPS throughput increases from approximately 50 IOPS to 300 IOPS.
To conclude, we deduce that there is a causal link between Zabbix IOPS throughput and slowness on our environment, but we don't understand how to concretely decrease Zabbix IOPS throughput.
Thanks in advance for your feedback. I am at your disposal for any questions.
Comment