Every other night exactly around 11:30 - 11:40 I get these flood of alerts (unreachable for 5 minutes) from all my hosts. Before it used to usually start with "poller processes 75% busy:” and ends with “poller processes 75% busy”. But now I do not see Poller process alerts any longer may be because we switched to Active checks.
The issue won’t go away regardless of several changes to zabbix_server and mysql parameters. I also tried to delay the alert reporting (because unreachable issue corrects it self almost immediately) but that doesn't seem to be working correctly. So far the only way to avoid it is to change the severity of the alerts and opt out to receiving the text/emails. but that is very dangerous as we are using it Operationally. Here are my options:
Zabbix-server.conf
LogFile=/var/log/zabbix/zabbix_server.log
LogFileSize=0
PidFile=/var/run/zabbix/zabbix_server.pid
DBHost=localhost
DBName=db1
DBUser=dbusr
DBPassword=psswd
DBSocket=/var/lib/mysql/mysql.sock
StartPollers=1000
StartDiscoverers=150
SNMPTrapperFile=/var/log/snmptt/snmptt.log
ListenIP=0.0.0.0
HousekeepingFrequency=2
MaxHousekeeperDelete=1000
CacheSize=128M
StartDBSyncers=10
ValueCacheSize=128M
Timeout=30
UnreachableDelay=60
AlertScriptsPath=/usr/lib/zabbix/alertscripts
ExternalScripts=/usr/lib/zabbix/externalscripts
Zabbix-agent.conf
PidFile=/var/run/zabbix/zabbix_agentd.pid
LogFile=/var/log/zabbix/zabbix_agentd.log
LogFileSize=0
Server=localhost
ListenPort=10050
ServerActive=1.2.3.4
Hostname=Server1.gov
Include=/etc/zabbix/zabbix_agentd.d/
My.cnf
[mysqld]
datadir=/var/lib/mysql
socket=/var/lib/mysql/mysql.sock
user=mysql
max_connections=10000
wait_timeout=95000
max_allowed_packet=1000M
innodb_buffer_pool_size=32G
innodb_buffer_pool_instances=1
query_cache_type=1
query_cache_size=128M
join_buffer_size=300
table_open_cache=2500
binlog_format=mixed
symbolic-links=0
[mysqld_safe]
log-error=/var/log/mysqld.log
pid-file=/var/run/mysqld/mysqld.pid
The issue won’t go away regardless of several changes to zabbix_server and mysql parameters. I also tried to delay the alert reporting (because unreachable issue corrects it self almost immediately) but that doesn't seem to be working correctly. So far the only way to avoid it is to change the severity of the alerts and opt out to receiving the text/emails. but that is very dangerous as we are using it Operationally. Here are my options:
Zabbix-server.conf
LogFile=/var/log/zabbix/zabbix_server.log
LogFileSize=0
PidFile=/var/run/zabbix/zabbix_server.pid
DBHost=localhost
DBName=db1
DBUser=dbusr
DBPassword=psswd
DBSocket=/var/lib/mysql/mysql.sock
StartPollers=1000
StartDiscoverers=150
SNMPTrapperFile=/var/log/snmptt/snmptt.log
ListenIP=0.0.0.0
HousekeepingFrequency=2
MaxHousekeeperDelete=1000
CacheSize=128M
StartDBSyncers=10
ValueCacheSize=128M
Timeout=30
UnreachableDelay=60
AlertScriptsPath=/usr/lib/zabbix/alertscripts
ExternalScripts=/usr/lib/zabbix/externalscripts
Zabbix-agent.conf
PidFile=/var/run/zabbix/zabbix_agentd.pid
LogFile=/var/log/zabbix/zabbix_agentd.log
LogFileSize=0
Server=localhost
ListenPort=10050
ServerActive=1.2.3.4
Hostname=Server1.gov
Include=/etc/zabbix/zabbix_agentd.d/
My.cnf
[mysqld]
datadir=/var/lib/mysql
socket=/var/lib/mysql/mysql.sock
user=mysql
max_connections=10000
wait_timeout=95000
max_allowed_packet=1000M
innodb_buffer_pool_size=32G
innodb_buffer_pool_instances=1
query_cache_type=1
query_cache_size=128M
join_buffer_size=300
table_open_cache=2500
binlog_format=mixed
symbolic-links=0
[mysqld_safe]
log-error=/var/log/mysqld.log
pid-file=/var/run/mysqld/mysqld.pid

Comment