I used zabbix v1.4.4 for several months and felt stable always, however, it died suddenly few days ago and then become very unstable. Below I showed the last 10+ rows server log in debug level 4 during start.
14337:20080225:093514 In delete_history(history,17152,31268674383,699408)
14337:20080225:093514 Query [select min(clock) from history where itemid=17152]
14337:20080225:093514 In delete_history(history_uint,17152,31268674383,6994 08)
14337:20080225:093514 Query [select min(clock) from history_uint where itemid=17152]
14337:20080225:093514 In delete_history(history_str,17152,31268674383,69940 8)
14337:20080225:093514 Query [select min(clock) from history_str where itemid=17152]
14307:20080225:093514 One child process died. Exiting ...
14324:20080225:093514 Got signal. Exiting ...
14325:20080225:093514 Got signal. Exiting ...
14327:20080225:093514 Got signal. Exiting ...
14328:20080225:093514 Got signal. Exiting ...
14329:20080225:093514 Got signal. Exiting ...
14330:20080225:093514 Got signal. Exiting ...
14331:20080225:093514 Got signal. Exiting ...
14332:20080225:093514 Got signal. Exiting ...
14333:20080225:093514 Got signal. Exiting ...
14335:20080225:093514 Got signal. Exiting ...
14339:20080225:093514 Got signal. Exiting ...
14340:20080225:093514 Got signal. Exiting ...
14341:20080225:093514 Got signal. Exiting ...
14342:20080225:093514 Got signal. Exiting ...
14343:20080225:093514 Got signal. Exiting ...
14344:20080225:093514 Got signal. Exiting ...
14347:20080225:093514 Got signal. Exiting ...
14337:20080225:093514 Got signal. Exiting ...
14307:20080225:093516 ZABBIX Server stopped
In this log, I can't figure out what's the child process that leads zabbix die for troubleshooting.
I hope that I can fix this problem because I feel zabbix is a good monitoring tools. If need, I can provide the whole server log as well as the audit log for investigation.
14337:20080225:093514 In delete_history(history,17152,31268674383,699408)
14337:20080225:093514 Query [select min(clock) from history where itemid=17152]
14337:20080225:093514 In delete_history(history_uint,17152,31268674383,6994 08)
14337:20080225:093514 Query [select min(clock) from history_uint where itemid=17152]
14337:20080225:093514 In delete_history(history_str,17152,31268674383,69940 8)
14337:20080225:093514 Query [select min(clock) from history_str where itemid=17152]
14307:20080225:093514 One child process died. Exiting ...
14324:20080225:093514 Got signal. Exiting ...
14325:20080225:093514 Got signal. Exiting ...
14327:20080225:093514 Got signal. Exiting ...
14328:20080225:093514 Got signal. Exiting ...
14329:20080225:093514 Got signal. Exiting ...
14330:20080225:093514 Got signal. Exiting ...
14331:20080225:093514 Got signal. Exiting ...
14332:20080225:093514 Got signal. Exiting ...
14333:20080225:093514 Got signal. Exiting ...
14335:20080225:093514 Got signal. Exiting ...
14339:20080225:093514 Got signal. Exiting ...
14340:20080225:093514 Got signal. Exiting ...
14341:20080225:093514 Got signal. Exiting ...
14342:20080225:093514 Got signal. Exiting ...
14343:20080225:093514 Got signal. Exiting ...
14344:20080225:093514 Got signal. Exiting ...
14347:20080225:093514 Got signal. Exiting ...
14337:20080225:093514 Got signal. Exiting ...
14307:20080225:093516 ZABBIX Server stopped
In this log, I can't figure out what's the child process that leads zabbix die for troubleshooting.
I hope that I can fix this problem because I feel zabbix is a good monitoring tools. If need, I can provide the whole server log as well as the audit log for investigation.

Comment