Zabbix b5 went ape shit on me this morning. Between 8:37 AM and 8:44 Zabbix triggered all nodata() triggers and everyone's pager went off. In addition, the CPU utilization dropped at the same time. It appears zabbix stopped allowing connections into it.
It appears a bunch of zabbix_server processes are missing. After restarting the server, I have 11 zabbix_server processes. Below I see 4.
I also saved the log files if anyone is interested. I don't understand the date format of them though.
-bb
# telnet localhost 10051
Trying 127.0.0.1...
telnet: Unable to connect to remote host: Connection refused
root 10969 0.0 0.2 2388 1064 ? Ss Jan02 0:01 /bin/sh /usr/bin/zabbix_check_server 3 120 root
zabbix 20690 0.1 0.1 2516 1012 ? SN Jan02 4:23 /usr/bin/zabbix_agentd
zabbix 20691 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20692 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20693 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20694 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20695 0.0 0.1 2516 940 ? SN Jan02 0:23 /usr/bin/zabbix_agentd
zabbix 26455 0.0 0.3 5324 1676 ? S 08:30 0:00 /usr/bin/zabbix_server
zabbix 26457 0.1 0.3 5324 1780 ? S 08:30 0:10 /usr/bin/zabbix_server
zabbix 26459 0.0 0.3 5324 1752 ? S 08:30 0:05 /usr/bin/zabbix_server
zabbix 26461 0.0 0.3 5324 1608 ? S 08:30 0:00 /usr/bin/zabbix_server
##zabbix_server.conf:
Server=1
StartSuckers=6
StartTrappers=5
ListenPort=10051
HousekeepingFrequency=1
SenderFrequency=30
DebugLevel=4
Timeout=5
PidFile=/var/run/zabbix/server.pid
LogFile=/var/log/zabbix/server.log
AlertScriptsPath=/home/zabbix/bin/
FpingLocation=/usr/sbin/fping
It appears a bunch of zabbix_server processes are missing. After restarting the server, I have 11 zabbix_server processes. Below I see 4.
I also saved the log files if anyone is interested. I don't understand the date format of them though.
-bb
# telnet localhost 10051
Trying 127.0.0.1...
telnet: Unable to connect to remote host: Connection refused
root 10969 0.0 0.2 2388 1064 ? Ss Jan02 0:01 /bin/sh /usr/bin/zabbix_check_server 3 120 root
zabbix 20690 0.1 0.1 2516 1012 ? SN Jan02 4:23 /usr/bin/zabbix_agentd
zabbix 20691 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20692 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20693 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20694 0.0 0.1 2516 716 ? SN Jan02 0:00 /usr/bin/zabbix_agentd
zabbix 20695 0.0 0.1 2516 940 ? SN Jan02 0:23 /usr/bin/zabbix_agentd
zabbix 26455 0.0 0.3 5324 1676 ? S 08:30 0:00 /usr/bin/zabbix_server
zabbix 26457 0.1 0.3 5324 1780 ? S 08:30 0:10 /usr/bin/zabbix_server
zabbix 26459 0.0 0.3 5324 1752 ? S 08:30 0:05 /usr/bin/zabbix_server
zabbix 26461 0.0 0.3 5324 1608 ? S 08:30 0:00 /usr/bin/zabbix_server
##zabbix_server.conf:
Server=1
StartSuckers=6
StartTrappers=5
ListenPort=10051
HousekeepingFrequency=1
SenderFrequency=30
DebugLevel=4
Timeout=5
PidFile=/var/run/zabbix/server.pid
LogFile=/var/log/zabbix/server.log
AlertScriptsPath=/home/zabbix/bin/
FpingLocation=/usr/sbin/fping

Comment