Hi,
We upgraded our Zabbix cluster from 2.0.5 yesterday to 2.2.4.
Everything went smoothly, no issues, etc...
One thing we noticed however, was that the server performance seems to be quite a bit different than the last version.
Our graphs for server performance show a lot more regular spikes than before, as well as our queues which were normally fairly empty are showing more residual delay. The items are processing, and moving through, but it's just taking longer. We're also seeing similar spikes in network traffic.
I've looked at server resources, and we're doing ok as far as I can tell. The servers have pretty good specs, and don't seem heavily utilized:
CPU: 32 cores, about 10% utilization
Memory: 128GB, about 53GB free
Our running config wasn't changed at all, as we manage it with Puppet.
Here's our config:
############ GENERAL PARAMETERS #################
LogFile=/redacted/log/zabbix/zabbix_server.log
LogFileSize=100
PidFile=/var/run/zabbix/zabbix_server.pid
DBHost=<redacted>
DBName=<redacted>
DBUser=<redacted>
DBPassword=<redacted>
DBPort=3306
StartPollers=60
StartPollersUnreachable=10
StartTrappers=20
StartPingers=30
StartDiscoverers=80
StartHTTPPollers=10
############ ADVANCED PARAMETERS ################
#DisableHousekeeping=1
CacheSize=512M
CacheUpdateFrequency=60
StartDBSyncers=50
HistoryCacheSize=64M
TrendCacheSize=128M
HistoryTextCacheSize=128M
Timeout=20
Include=/etc/zabbix/conf.d/server
AlertScriptsPath=/redacted/data/zabbix/alertscripts
JavaGateway=localhost
JavaGatewayPort=10052
StartJavaPollers=10
Here are the graphs:


Ideas:
So my first thought was that Housekeeping was somehow enabled by the upgrade, but it wasn't. We partition our databases every day, as well as they're running on their own Mysql cluster with equal level hardware. Everything there is running on a 10 drive SSD raid array as well, so response time is very fast.
Proxies, we upgraded all of them as well (due to no backward/forward compatibility), and they seem to be working as well. Values of our items appear current on all the hosts I've spot checked.
I looked to see if there was any kind of change with new required system variables/settings, but none of the new ones appeared mandatory.
Any ideas?
-Zillions
We upgraded our Zabbix cluster from 2.0.5 yesterday to 2.2.4.
Everything went smoothly, no issues, etc...
One thing we noticed however, was that the server performance seems to be quite a bit different than the last version.
Our graphs for server performance show a lot more regular spikes than before, as well as our queues which were normally fairly empty are showing more residual delay. The items are processing, and moving through, but it's just taking longer. We're also seeing similar spikes in network traffic.
I've looked at server resources, and we're doing ok as far as I can tell. The servers have pretty good specs, and don't seem heavily utilized:
CPU: 32 cores, about 10% utilization
Memory: 128GB, about 53GB free
Our running config wasn't changed at all, as we manage it with Puppet.
Here's our config:
############ GENERAL PARAMETERS #################
LogFile=/redacted/log/zabbix/zabbix_server.log
LogFileSize=100
PidFile=/var/run/zabbix/zabbix_server.pid
DBHost=<redacted>
DBName=<redacted>
DBUser=<redacted>
DBPassword=<redacted>
DBPort=3306
StartPollers=60
StartPollersUnreachable=10
StartTrappers=20
StartPingers=30
StartDiscoverers=80
StartHTTPPollers=10
############ ADVANCED PARAMETERS ################
#DisableHousekeeping=1
CacheSize=512M
CacheUpdateFrequency=60
StartDBSyncers=50
HistoryCacheSize=64M
TrendCacheSize=128M
HistoryTextCacheSize=128M
Timeout=20
Include=/etc/zabbix/conf.d/server
AlertScriptsPath=/redacted/data/zabbix/alertscripts
JavaGateway=localhost
JavaGatewayPort=10052
StartJavaPollers=10
Here are the graphs:


Ideas:
So my first thought was that Housekeeping was somehow enabled by the upgrade, but it wasn't. We partition our databases every day, as well as they're running on their own Mysql cluster with equal level hardware. Everything there is running on a 10 drive SSD raid array as well, so response time is very fast.
Proxies, we upgraded all of them as well (due to no backward/forward compatibility), and they seem to be working as well. Values of our items appear current on all the hosts I've spot checked.
I looked to see if there was any kind of change with new required system variables/settings, but none of the new ones appeared mandatory.
Any ideas?
-Zillions
Comment