I start from a clean installation of Zabbix 1.4.4, which humms along smoothly without any issues while there are no hosts. I also have compiled the Agent on a machine I wish to monitor. I made no configuration changes on either the client or server, except for the MySQL login details. The only thing I did through the front-end is add a single host, one with the agent running on it, and associated it with the Template_Solaris template.
When I start the agent and add the host via the Zabbix front-end (and vice versa) the server immediately panics:
The data for the new host is successfully fetched from the client, and shows up in the web frontend (under "latest data" page)...so the fault isn't with the client connections themselves, it is something that happens later (?)
Running pstack on the main server process shows it receives signal 18 (SIGCLD), which makes sense since "One child process died":
I have a simple system of shell scripts which runs pstack on each zabbix_server process, and prints the result to a text file. pstack is not catching any of the main process' children dying....which is perplexing, as it should find something, since they are exiting on a signal. It is a rather coarse "monitoring system", with pstack output recorded - on average - 3 times per second, so that probably explains it. I wish I had DTrace on this machine 
Any ideas what might be going on here? The log for the server run can be found here
Thanks in advance!
--
Adam Lundrigan
Computer Systems Programmer
Biological & Physical Oceanography Section
Science, Oceans & Environment Branch
Department of Fisheries and Oceans Canada
Northwest Atlantic Fisheries Centre
St. John's, Newfoundland & Labrador
CANADA A1C 5X1
Tel: (709) 772-8136
Fax: (709) 772-8138
Cell: (709) 277-4575
Office: G10-117J
Email: [email protected]
When I start the agent and add the host via the Zabbix front-end (and vice versa) the server immediately panics:
Code:
20146:20080130:103328 End update_triggers [19821]
20149:20080130:103328 End update_functions()
20146:20080130:103328 Query [commit;]
20149:20080130:103328 In update_triggers [itemid:19824]
20149:20080130:103328 Query [select distinct t.triggerid,t.expression,t.description,t.url,t.comments,t.status,t.value,t.priority from triggers t,functions f,items i where i.status<>3 and i.itemid=f.itemid and t.status=0 and f.triggerid=t.triggerid and f.itemid=19824]
20147:20080130:103328 End update_functions()
20147:20080130:103328 In update_triggers [itemid:19767]
20147:20080130:103328 Query [select distinct t.triggerid,t.expression,t.description,t.url,t.comments,t.status,t.value,t.priority from triggers t,functions f,items i where i.status<>3 and i.itemid=f.itemid and t.status=0 and f.triggerid=t.triggerid and f.itemid=19767]
20145:20080130:103328 In evaluate_expression({12088}=0)
20147:20080130:103328 In evaluate_expression({12090}=0)
20145:20080130:103328 In substitute_simple_macros()
20145:20080130:103328 In substitute_simple_macros (data:{12088}=0)
20147:20080130:103328 In substitute_simple_macros()
20147:20080130:103328 In substitute_simple_macros (data:{12090}=0)
20144:20080130:103328 One child process died. Exiting ...
20146:20080130:103328 Got signal. Exiting ...
....
20149:20080130:103328 Got signal. Exiting ...
20144:20080130:103331 ZABBIX Server stopped
Running pstack on the main server process shows it receives signal 18 (SIGCLD), which makes sense since "One child process died":
Code:
20144: zabbix_server feea079c unlink (13ee08, 0, 0, 0, 0, 0) + 8 00052e84 daemon_stop (0, f, ffbfee14, 7d8, 1, 1e) + c 00036a68 zbx_on_exit (a, 21, 0, 0, 0, 0) + 188 000529f4 parent_signal_handler (12, 0, ffbfefe8, 0, 0, 0) + 3c feba56c8 __sighndlr (12, 0, ffbfefe8, 529b8, 0, 0) + c feb9f320 call_user_handler (12, 0, ffbfefe8, 0, 0, 0) + 234 feb9f4d0 sigacthandler (12, 0, ffbfefe8, 0, 0, 0) + 64 --- called from signal handler with signal 18 (SIGCLD) --- fee9ccd0 _libc_nanosleep (3c, 0, 0, 0, 0, 0) + 8 00049d80 main_watchdog_loop (a, 1f, 0, fffffff8, 0, 145ea5) + 28 0003624c MAIN_ZABBIX_ENTRY (a6710, ffbffc74, 0, 0, 8d, 21) + 4ec 00052e38 daemon_start (0, ffbffd74, c8ff0, c8ff8, 0, 2) + 3d8 00035d28 main (1, ffbffd74, ffbffd7c, c8c00, 0, 0) + 248 00024e40 _start (0, 0, 0, 0, 0, 0) + 108

Any ideas what might be going on here? The log for the server run can be found here
Thanks in advance!
--
Adam Lundrigan
Computer Systems Programmer
Biological & Physical Oceanography Section
Science, Oceans & Environment Branch
Department of Fisheries and Oceans Canada
Northwest Atlantic Fisheries Centre
St. John's, Newfoundland & Labrador
CANADA A1C 5X1
Tel: (709) 772-8136
Fax: (709) 772-8138
Cell: (709) 277-4575
Office: G10-117J
Email: [email protected]
Comment