Hi,
I'm having problem with zabbix-server under Red Hat Enterprise Linux 7. After some time ( i'd say 24h maybe less maybe more) systemd flag the service as "failed"
Trying to restart zabbix-server fail s long the main zabbix-server is not killed :
I'm seeing a lot of weird restart in /var/log/message :
I'm not sure but I think the problem is systemd related.
Here my unit file :
And my config file (DB password censored...) :
Before you ask yeah I know it's localhost host and yes it is port 3308... It's a stunnel to the real database on the database server. Conection the the database is working.
Has anyone ever had such problem with RHEL 7 or even with systemd in general ?
I'm having problem with zabbix-server under Red Hat Enterprise Linux 7. After some time ( i'd say 24h maybe less maybe more) systemd flag the service as "failed"
Code:
[root@AnonZabbixSrv ~]# systemctl status zabbix-server zabbix-server.service - Zabbix Server Loaded: loaded (/usr/lib/systemd/system/zabbix-server.service; enabled) Active: failed (Result: start-limit) since Wed 2015-10-14 09:48:02 EDT; 15min ago Process: 58200 ExecStart=/usr/sbin/zabbix_server -c $CONFFILE (code=exited, status=0/SUCCESS) Main PID: 58202 (code=exited, status=1/FAILURE) Oct 14 09:48:02 AnonZabbixSrv systemd[1]: zabbix-server.service: main process exited, code=exited, status=1/FAILURE Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Unit zabbix-server.service entered failed state. Oct 14 09:48:02 AnonZabbixSrv systemd[1]: zabbix-server.service holdoff time over, scheduling restart. Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Stopping Zabbix Server... Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Starting Zabbix Server... Oct 14 09:48:02 AnonZabbixSrv systemd[1]: zabbix-server.service start request repeated too quickly, refusing to start. Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Failed to start Zabbix Server. Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Unit zabbix-server.service entered failed state.
Code:
[root@AnonZabbixSrv ~]# systemctl restart zabbix-server
[root@AnonZabbixSrv ~]# systemctl status zabbix-server
zabbix-server.service - Zabbix Server
Loaded: loaded (/usr/lib/systemd/system/zabbix-server.service; enabled)
Active: failed (Result: start-limit) since Wed 2015-10-14 09:48:02 EDT; 44s ago
Process: 58200 ExecStart=/usr/sbin/zabbix_server -c $CONFFILE (code=exited, status=0/SUCCESS)
Main PID: 58202 (code=exited, status=1/FAILURE)
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: zabbix-server.service: main process exited, code=exited, status=1/FAILURE
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Unit zabbix-server.service entered failed state.
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: zabbix-server.service holdoff time over, scheduling restart.
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Stopping Zabbix Server...
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Starting Zabbix Server...
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: zabbix-server.service start request repeated too quickly, refusing to start.
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Failed to start Zabbix Server.
Oct 14 09:48:02 AnonZabbixSrv systemd[1]: Unit zabbix-server.service entered failed state.
[root@AnonZabbixSrv ~]# ps -A |grep zabbix
2375 ? 00:00:00 zabbix_agentd
2378 ? 00:00:16 zabbix_agentd
2379 ? 00:00:13 zabbix_agentd
2380 ? 00:00:15 zabbix_agentd
2381 ? 00:00:14 zabbix_agentd
7849 ? 00:00:00 zabbix_server
[root@AnonZabbixSrv ~]# kill -9 7849
[root@AnonZabbixSrv ~]# systemctl start zabbix-server
[root@AnonZabbixSrv ~]# systemctl status zabbix-server
zabbix-server.service - Zabbix Server
Loaded: loaded (/usr/lib/systemd/system/zabbix-server.service; enabled)
Active: active (running) since Wed 2015-10-14 10:04:13 EDT; 7s ago
Process: 60196 ExecStart=/usr/sbin/zabbix_server -c $CONFFILE (code=exited, status=0/SUCCESS)
Main PID: 60198 (zabbix_server)
CGroup: /system.slice/zabbix-server.service
├─60198 /usr/sbin/zabbix_server -c /etc/zabbix/zabbix_server.conf
├─60203 /usr/sbin/zabbix_server: configuration syncer [waiting 60 sec for processes]
├─60204 /usr/sbin/zabbix_server: db watchdog [synced alerts config in 0.025212 sec, idle 60 sec]
├─60205 /usr/sbin/zabbix_server: poller #1 [got 136 values in 3.704853 sec, getting values]
├─60207 /usr/sbin/zabbix_server: poller #2 [got 98 values in 3.901976 sec, getting values]
├─60208 /usr/sbin/zabbix_server: poller #3 [got 57 values in 3.878049 sec, getting values]
├─60209 /usr/sbin/zabbix_server: poller #4 [got 23 values in 5.271251 sec, getting values]
├─60210 /usr/sbin/zabbix_server: poller #5 [got 62 values in 4.595845 sec, getting values]
├─60211 /usr/sbin/zabbix_server: unreachable poller #1 [got 1 values in 0.003109 sec, getting values]
├─60212 /usr/sbin/zabbix_server: trapper #1 [processed data in 0.006886 sec, waiting for connection]
├─60213 /usr/sbin/zabbix_server: trapper #2 [processed data in 0.000000 sec, waiting for connection]
├─60214 /usr/sbin/zabbix_server: trapper #3 [processed data in 0.000000 sec, waiting for connection]
├─60215 /usr/sbin/zabbix_server: trapper #4 [processed data in 0.000000 sec, waiting for connection]
├─60216 /usr/sbin/zabbix_server: trapper #5 [processed data in 0.000000 sec, waiting for connection]
├─60217 /usr/sbin/zabbix_server: icmp pinger #1 [pinging hosts]
├─60218 /usr/sbin/zabbix_server: alerter [sent alerts: 0 success, 0 fail in 0.000953 sec, idle 30 sec]
├─60219 /usr/sbin/zabbix_server: housekeeper [startup idle for 30 minutes]
├─60220 /usr/sbin/zabbix_server: timer #1 [processed 0 triggers, 0 events in 0.000000 sec, 0 maintenances in 0.000000 s...
├─60221 /usr/sbin/zabbix_server: http poller #1 [got 0 values in 0.000000 sec, getting values]
├─60222 /usr/sbin/zabbix_server: discoverer #1 [processed 1 rules in 0.006757 sec, idle 10 sec]
├─60223 /usr/sbin/zabbix_server: history syncer #1 [synced 2 items in 0.078872 sec, idle 5 sec]
├─60224 /usr/sbin/zabbix_server: history syncer #2 [synced 430 items in 0.721895 sec, idle 5 sec]
├─60225 /usr/sbin/zabbix_server: history syncer #3 [synced 1 items in 0.042046 sec, idle 5 sec]
├─60226 /usr/sbin/zabbix_server: history syncer #4 [synced 0 items in 0.000001 sec, idle 5 sec]
├─60227 /usr/sbin/zabbix_server: escalator [processed 0 escalations in 0.000845 sec, idle 3 sec]
├─60228 /usr/sbin/zabbix_server: proxy poller #1 [exchanged data with 0 proxies in 0.000003 sec, idle 5 sec]
├─60229 /usr/sbin/zabbix_server: self-monitoring [processed data in 0.000003 sec, idle 1 sec]
├─60449 sh -c /usr/sbin/fping -C3 2>&1 </tmp/zabbix_server_60217.pinger;/usr/sbin/fping6 -C3 2>&1 </tmp/zabbix_server_6...
└─60452 /usr/sbin/fping -C3
Oct 14 10:04:13 AnonZabbixSrv systemd[1]: Starting Zabbix Server...
Oct 14 10:04:13 AnonZabbixSrv systemd[1]: PID file /run/zabbix/zabbix_server.pid not readable (yet?) after start.
Oct 14 10:04:13 AnonZabbixSrv systemd[1]: Started Zabbix Server.
Code:
[root@AnonZabbixSrv ~]# grep zabbix-server /var/log/messages | grep "Oct 13" Oct 13 10:35:50 AnonZabbixSrv systemd:Cannot add dependency job for unit zabbix-server-mysql.service, ignoring: Unit zabbix-server-mysql.service failed to load: No such file or directory. Oct 13 11:55:25 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 49544 which is not our child. We'll most likely not notice when it exits. Oct 13 11:57:43 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 11:57:43 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 56458 which is not our child. We'll most likely not notice when it exits. Oct 13 11:59:50 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:03:40 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:08:00 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:11:08 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:11:08 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 7990 which is not our child. We'll most likely not notice when it exits. Oct 13 12:15:01 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:15:01 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 20002 which is not our child. We'll most likely not notice when it exits. Oct 13 12:18:52 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:18:52 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 31427 which is not our child. We'll most likely not notice when it exits. Oct 13 12:22:39 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:22:39 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 42826 which is not our child. We'll most likely not notice when it exits. Oct 13 12:26:37 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:30:22 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:30:22 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 926 which is not our child. We'll most likely not notice when it exits. Oct 13 12:34:15 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:34:15 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 12682 which is not our child. We'll most likely not notice when it exits. Oct 13 12:38:09 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:41:36 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:45:42 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:45:42 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 47380 which is not our child. We'll most likely not notice when it exits. Oct 13 12:49:40 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:49:40 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 59903 which is not our child. We'll most likely not notice when it exits. Oct 13 12:53:26 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:53:26 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 6258 which is not our child. We'll most likely not notice when it exits. Oct 13 12:57:41 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 12:57:41 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 19182 which is not our child. We'll most likely not notice when it exits. Oct 13 13:01:31 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:05:15 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:09:04 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:12:47 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:16:38 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:19:52 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:19:52 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 63208 which is not our child. We'll most likely not notice when it exits. Oct 13 13:23:41 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:27:44 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:31:21 AnonZabbixSrv systemd:zabbix-server.service holdoff time over, scheduling restart. Oct 13 13:31:21 AnonZabbixSrv systemd:zabbix-server.service: Supervising process 34252 which is not our child. We'll most likely not notice when it exits.
Here my unit file :
Code:
[root@lgp002 ~]# cat /usr/lib/systemd/system/zabbix-server.service [Unit] Description=Zabbix Server After=syslog.target After=network.target [Service] Environment="CONFFILE=/etc/zabbix/zabbix_server.conf" EnvironmentFile=-/etc/sysconfig/zabbix-server Type=forking Restart=always PIDFile=/run/zabbix/zabbix_server.pid KillMode=process ExecStart=/usr/sbin/zabbix_server -c $CONFFILE [Install] WantedBy=multi-user.target
Code:
[root@AnonZabbixSrv ~]# grep -v ^# /etc/zabbix/zabbix_server.conf LogFile=/var/log/zabbix/zabbix_server.log LogFileSize=0 PidFile=/run/zabbix/zabbix_server.pid DBHost=127.0.0.1 DBName=zabbix DBUser=zabbix DBPassword=********* DBPort=3308 SNMPTrapperFile=/var/log/snmptt/snmptt.log AlertScriptsPath=/usr/lib/zabbix/alertscripts ExternalScripts=/usr/lib/zabbix/externalscripts
Has anyone ever had such problem with RHEL 7 or even with systemd in general ?
Comment