Recently I began getting notifications from zabbix such as the following...
This started a few days ago and I can't think of any changes to my server or it's configuration that could effect this. At the same time my CPU usage spiked and has stayed very high.
The server runs on CentOS 5.6, Zabbix-1.8.6 (occurred during 1.8.5 also) using MySQL backend. The system is a VM with 2CPUs and 2GB RAM. Thus far it's performed well but ever since this history write cache problem the processor load has begun to spike.
I've already tried doubling the default size on both "HistoryCacheSize" to 16M and "HistoryTextCacheSize" to 32M with the problem still persisting. I get notifications from zabbix many times an hour now about this problem.
Here are a few graphs to illustrate the problem and see the config for this trigger...It's hard to see on the uploaded images but right on July 31 is when both CPU and History Write Cache began to go crazy, and the 15 minute load average has stayed around 4 (w/ 2 cores).
The first image is the graph of the History Write Cache % Free
The second is the config for the trigger
The third is the CPU spike I have seen...
Forth is the overall workload on the zabbix server.
Here is my entire zabbix_server.conf (with sensative info removed)
Thanks
- Trey
Host: Zabbix server
Less than 20% free in the history cache: PROBLEM
Last value: 10.969925
Problem started: 2011.08.09 10:01:23 Age: 10m
Less than 20% free in the history cache: PROBLEM
Last value: 10.969925
Problem started: 2011.08.09 10:01:23 Age: 10m
The server runs on CentOS 5.6, Zabbix-1.8.6 (occurred during 1.8.5 also) using MySQL backend. The system is a VM with 2CPUs and 2GB RAM. Thus far it's performed well but ever since this history write cache problem the processor load has begun to spike.
I've already tried doubling the default size on both "HistoryCacheSize" to 16M and "HistoryTextCacheSize" to 32M with the problem still persisting. I get notifications from zabbix many times an hour now about this problem.
Here are a few graphs to illustrate the problem and see the config for this trigger...It's hard to see on the uploaded images but right on July 31 is when both CPU and History Write Cache began to go crazy, and the 15 minute load average has stayed around 4 (w/ 2 cores).
The first image is the graph of the History Write Cache % Free
The second is the config for the trigger
The third is the CPU spike I have seen...
Forth is the overall workload on the zabbix server.
Here is my entire zabbix_server.conf (with sensative info removed)
LogFile=/var/log/zabbix/zabbix_server.log
PidFile=/var/run/zabbix/zabbix_server.pid
DBHost=localhost
DBName=....
DBUser=....
DBPassword=....
DBSocket=/var/lib/mysql/mysql.sock
StartPollers=10
StartIPMIPollers=3
StartPingers=5
HousekeepingFrequency=3
HistoryCacheSize=16M
TrendCacheSize=8M
HistoryTextCacheSize=32M
AlertScriptsPath=/home/zabbix/bin/
FpingLocation=/usr/sbin/fping
PidFile=/var/run/zabbix/zabbix_server.pid
DBHost=localhost
DBName=....
DBUser=....
DBPassword=....
DBSocket=/var/lib/mysql/mysql.sock
StartPollers=10
StartIPMIPollers=3
StartPingers=5
HousekeepingFrequency=3
HistoryCacheSize=16M
TrendCacheSize=8M
HistoryTextCacheSize=32M
AlertScriptsPath=/home/zabbix/bin/
FpingLocation=/usr/sbin/fping
- Trey
Comment