Hey everyone,
I have an issue that I have been wrestling with for a little bit. I have a zabbix server instance that cannot process events fast enough, and there is a delay in reporting by about 3 to 4 hours typically. I assume that the server is not powerful enough or there is a problem with configuration.
Server Specs
16 GB memory
8 CPUs
20 GB HDD
System
Number of Hosts: 594
New Values per Second: ~400
zabbix_server.conf
ExportFileSize=1G
StartPollers=40
StartIPMIPollers=0
StartPollersUnreachable=10
StartTrappers=6
StartPingers=1
StartDiscoverers=4
StartHTTPPollers=2
StartTimers=1
StartEscalators=1
StartVMwareCollectors=0
VMwareFrequency=60
VMwarePerfFrequency=60
VMwareCacheSize=8M
VMwareTimeout=10
SNMPTrapperFile=/tmp/zabbix_traps.tmp
StartSNMPTrapper=0
HousekeepingFrequency=4
MaxHousekeeperDelete=500
CacheSize=2G
CacheUpdateFrequency=60
StartDBSyncers=4
HistoryCacheSize=512M
HistoryIndexCacheSize=64M
TrendCacheSize=1G
ValueCacheSize=2G
Timeout=3
TrapperTimeout=300
UnreachablePeriod=45
UnavailableDelay=60
UnreachableDelay=15
What I am not sure about is whether I am throttling the services by having too many pollers, or another configuration issue. The CPU load is around 1 on the host, so it doesn't seem particularly stressed out.
Thoughts?
I have an issue that I have been wrestling with for a little bit. I have a zabbix server instance that cannot process events fast enough, and there is a delay in reporting by about 3 to 4 hours typically. I assume that the server is not powerful enough or there is a problem with configuration.
Server Specs
16 GB memory
8 CPUs
20 GB HDD
System
Number of Hosts: 594
New Values per Second: ~400
zabbix_server.conf
ExportFileSize=1G
StartPollers=40
StartIPMIPollers=0
StartPollersUnreachable=10
StartTrappers=6
StartPingers=1
StartDiscoverers=4
StartHTTPPollers=2
StartTimers=1
StartEscalators=1
StartVMwareCollectors=0
VMwareFrequency=60
VMwarePerfFrequency=60
VMwareCacheSize=8M
VMwareTimeout=10
SNMPTrapperFile=/tmp/zabbix_traps.tmp
StartSNMPTrapper=0
HousekeepingFrequency=4
MaxHousekeeperDelete=500
CacheSize=2G
CacheUpdateFrequency=60
StartDBSyncers=4
HistoryCacheSize=512M
HistoryIndexCacheSize=64M
TrendCacheSize=1G
ValueCacheSize=2G
Timeout=3
TrapperTimeout=300
UnreachablePeriod=45
UnavailableDelay=60
UnreachableDelay=15
What I am not sure about is whether I am throttling the services by having too many pollers, or another configuration issue. The CPU load is around 1 on the host, so it doesn't seem particularly stressed out.
Thoughts?