PDA

View Full Version : zabbix_server dies when taking action


Delta
24-12-2007, 10:18
I have a fairly new installation of Zabbix 1.4.4 (upgraded from 1.4.3, which I hoped would solve my problem below, which it didn't) on FreeBSD 6.2 that monitors correctly but unfortunately seems to crash when an action is performed.

The situation is as follows: about 15 servers are monitored. I can produce all graphs and when a host or service is down it is correctly displayed as down. Now I have created an action that sends me an email when one of the hosts is down. When I activate the action and when it sees that the host is down, zabbix_server crashes.

When I restart zabbix_server it crashes again after a few seconds. When I disable the action (or undo whatever is triggering the action), all is fine and zabbix_server will continue running.

I added the level 3 and level 4 debug output (I took out quite a few rows in the level 4 output that started with 'In delete_history' because otherwise it would be too large to post; even zipped) from zabbix_server.log. The zabbix_client processes on the monitored hosts are not affected and keep running without any errors.

In this case there is an action that triggers on Host availability of the host EU-DBTEST. As you can see at the bottom of the log, it correctly notes that the host is down and then crashes.

This is the message in /var/log/messages from the level 3 debug:
Dec 24 09:28:02 zabbix kernel: pid 59829 (zabbix_server), uid 1003: exited on signal 11

This is the message in /var/log/messages from the level 4 debug:
Dec 24 09:30:58 zabbix kernel: pid 59879 (zabbix_server), uid 1003: exited on signal 11

uid 1003 is the zabbix account

Does anybody have any idea? This is of course a show stopper, since Zabbix is supposed to notify me instead of crashing and thus stop monitoring at all.

wanga
26-12-2007, 15:06
I have a fairly new installation of Zabbix 1.4.4 (upgraded from 1.4.3, which I hoped would solve my problem below, which it didn't) on FreeBSD 6.2 that monitors correctly but unfortunately seems to crash when an action is performed.

The situation is as follows: about 15 servers are monitored. I can produce all graphs and when a host or service is down it is correctly displayed as down. Now I have created an action that sends me an email when one of the hosts is down. When I activate the action and when it sees that the host is down, zabbix_server crashes.

When I restart zabbix_server it crashes again after a few seconds. When I disable the action (or undo whatever is triggering the action), all is fine and zabbix_server will continue running.

I added the level 3 and level 4 debug output (I took out quite a few rows in the level 4 output that started with 'In delete_history' because otherwise it would be too large to post; even zipped) from zabbix_server.log. The zabbix_client processes on the monitored hosts are not affected and keep running without any errors.

In this case there is an action that triggers on Host availability of the host EU-DBTEST. As you can see at the bottom of the log, it correctly notes that the host is down and then crashes.

This is the message in /var/log/messages from the level 3 debug:
Dec 24 09:28:02 zabbix kernel: pid 59829 (zabbix_server), uid 1003: exited on signal 11

This is the message in /var/log/messages from the level 4 debug:
Dec 24 09:30:58 zabbix kernel: pid 59879 (zabbix_server), uid 1003: exited on signal 11

uid 1003 is the zabbix account

Does anybody have any idea? This is of course a show stopper, since Zabbix is supposed to notify me instead of crashing and thus stop monitoring at all.

I've got a very similar problem here as well. Running on FreeBSD 5.5 and 1.4.4

wit
27-12-2007, 10:36
I've got the same ploblem.
Running on FreeBSD 6.2 and 1.4.4

When I disable the action (or undo whatever is triggering the action), all is fine and zabbix_server will continue running.

wit
28-12-2007, 14:51
Sorry my bad english.

I added some value in table ids (i think zabbix not found this value and shutdown, its very strange for me!!!)

MYSQL v.5.0.51
INSERT INTO `ids`
(`nodeid`, `table_name`, `field_name`, `nextid`) VALUES
(0, 'alerts', 'alertid', 1)

Zabbix 1.4.2 have this values in this table.

dsipe
03-01-2008, 00:14
Just upgraded from 1.4.2 to 1.4.4 and got the same problem.

I need to restart zabbix server several times per day ...

Wondering what to do ! :mad:

EtherTrogg
29-01-2008, 18:23
This problem just started happening to me today. I've been running 1.4.4 for a couple of weeks now, with mixed results. Now it's completely unusable.

I added two new hosts today.

Zabbix server starts, runs for less that a minute, then dies. /var/log/messages reports "exiting on signal 11".

No problems with MySQL server running on this machine.

Suggestions?

EtherTrogg
29-01-2008, 21:48
Dropping my Zabbix DB and restoring from last night's backup seems to have fixed the problem. My 1.4.4 installation is back up and running with only minimal lose of data.

Still, it concerns me that this happened. I know the developers won't be able to do too much to address this since I can't give you any significant information beyond "it broke".

If I can re-create the problem, I'll post more information.

And I still think that Zabbix is an excellent product, even with the occasional problems.