Hi
I am having a problem with Zabbix 2.2.3, namely that is doesn't appear to be processing all values because one History syncer iis hanging with 100% CPU
It looks similar to https://support.zabbix.com/browse/ZBX-7725 in symptoms but I am not sure how to troubleshoot it.
My Setup is
Zabbix 2.2.3 on RHEL 6 with around 80 Values per second.
Oracle 11R2 on RHEL 6 as backend
My process output looks like this:
zabbix 16120 16092 0 16:42 ? 00:00:00 zabbix_server: history syncer #4 [synced 1 items in 0.006565 sec, syncing history]
zabbix 16121 16092 99 16:42 ? 01:54:52 zabbix_server: history syncer #5 [synced 0 items in 0.000029 sec, syncing history]
zabbix 16122 16092 0 16:42 ? 00:00:00 zabbix_server: history syncer #6 [synced 0 items in 0.001868 sec, syncing history]
as you can see one of the threads are consuming 100% CPU while all the others are sitting at 0% CPU and syncing history
I have checked the queue and it appears it is the same 200+ values which are stuck in the queue, some of them for several hours i.e. the oldest entries are not being processed. It queue depth doesn't seem to increase på 80 values a sec, but that is most likely because the agengt are reporting connection failures
I have checked the database and there is no load on it at all. Load average shows 0.1 or so.
I have tried increasing the various cache sizes from default to 256M, and the DBsyncers up to 32 and now back to 16, but it was also a problem with the default value of 4.
Does someone have some pointers in how to overcome this issue? or what to look for in the Logs (I have enabled DEbug level 4 and Log slow queries)
I am having a problem with Zabbix 2.2.3, namely that is doesn't appear to be processing all values because one History syncer iis hanging with 100% CPU
It looks similar to https://support.zabbix.com/browse/ZBX-7725 in symptoms but I am not sure how to troubleshoot it.
My Setup is
Zabbix 2.2.3 on RHEL 6 with around 80 Values per second.
Oracle 11R2 on RHEL 6 as backend
My process output looks like this:
zabbix 16120 16092 0 16:42 ? 00:00:00 zabbix_server: history syncer #4 [synced 1 items in 0.006565 sec, syncing history]
zabbix 16121 16092 99 16:42 ? 01:54:52 zabbix_server: history syncer #5 [synced 0 items in 0.000029 sec, syncing history]
zabbix 16122 16092 0 16:42 ? 00:00:00 zabbix_server: history syncer #6 [synced 0 items in 0.001868 sec, syncing history]
as you can see one of the threads are consuming 100% CPU while all the others are sitting at 0% CPU and syncing history
I have checked the queue and it appears it is the same 200+ values which are stuck in the queue, some of them for several hours i.e. the oldest entries are not being processed. It queue depth doesn't seem to increase på 80 values a sec, but that is most likely because the agengt are reporting connection failures
I have checked the database and there is no load on it at all. Load average shows 0.1 or so.
I have tried increasing the various cache sizes from default to 256M, and the DBsyncers up to 32 and now back to 16, but it was also a problem with the default value of 4.
Does someone have some pointers in how to overcome this issue? or what to look for in the Logs (I have enabled DEbug level 4 and Log slow queries)

Comment