I am experience a problem with zabbix during high loads.
What happens is that zabbix does not retrieve any data for a period of time. My guess is that it is not getting any process time during these high loads. I also noticed that by default zabbix_agentd runs with a nice value of 5.
in the zabbix_agentd.conf:
timeout=25
in the zabbix_server.conf:
# Number of pre-forked instances of pollers
# Default value is 6
# This parameter must be between 5 and 255
StartPollers=24
# Number of pre-forked instances of trappers
# Default value is 5
# This parameter must be between 2 and 255
StartTrappers=24
# How often ZABBIX will try to send unsent alerts
# (in seconds)
# Default value is 30 seconds
SenderFrequency=30
in the zabbix_server.log i get the following:
003010:20060823:130317 The value is not stored in database.
003044:20060823:130317 Timeout while receiving data from [bbapp6]
003044:20060823:130317 Getting value of [apache.bytesperreq] from host [bbapp6] failed
003044:20060823:130317 The value is not stored in database.
003012:20060823:130317 Timeout while receiving data from [bbapp6]
003012:20060823:130317 Getting value of [net.if.in[eth0]] from host [bbapp6] failed
This happens for ALL ITEMS MONITORED
Also, immediately after the load cameback down data started coming in again. So I really think this is a zabbix_agentd issue.
Is there a way to solve this or is there at the very least a way to monitor for this?
What happens is that zabbix does not retrieve any data for a period of time. My guess is that it is not getting any process time during these high loads. I also noticed that by default zabbix_agentd runs with a nice value of 5.
in the zabbix_agentd.conf:
timeout=25
in the zabbix_server.conf:
# Number of pre-forked instances of pollers
# Default value is 6
# This parameter must be between 5 and 255
StartPollers=24
# Number of pre-forked instances of trappers
# Default value is 5
# This parameter must be between 2 and 255
StartTrappers=24
# How often ZABBIX will try to send unsent alerts
# (in seconds)
# Default value is 30 seconds
SenderFrequency=30
in the zabbix_server.log i get the following:
003010:20060823:130317 The value is not stored in database.
003044:20060823:130317 Timeout while receiving data from [bbapp6]
003044:20060823:130317 Getting value of [apache.bytesperreq] from host [bbapp6] failed
003044:20060823:130317 The value is not stored in database.
003012:20060823:130317 Timeout while receiving data from [bbapp6]
003012:20060823:130317 Getting value of [net.if.in[eth0]] from host [bbapp6] failed
This happens for ALL ITEMS MONITORED
Also, immediately after the load cameback down data started coming in again. So I really think this is a zabbix_agentd issue.
Is there a way to solve this or is there at the very least a way to monitor for this?