hello everyone. i have installed zabbix 1.1.1 on sparc-solaris9 host. mysql 5.0.21, php 4.4.2, netsnmp 5.2.0 already installed. compilation and agentrun was no problem. webintterface working fine, but i cant understand why zabbix_server don't wont to run. it starting and just quit like its all right. log in debug mode contain following records in tail:
022750:20060807:151842 7. PID=[22758]
022750:20060807:151842 8. PID=[22759]
022750:20060807:151842 9. PID=[22760]
022750:20060807:151842 ZABBIX server is up.
022758:20060807:151842 In child_main()
022751:20060807:151842 Executing query:select a.alertid,a.mediatypeid,a.sendto,a.subject,a.messa ge,a.status,a.retries,mt.mediatypeid,mt.type,mt.de scription,mt.smtp_server,mt.smtp_helo,mt.smtp_emai l,mt.exec_path,a.delay,mt.gsm_modem from alerts a,media_type mt where a.status=0 and a.retries<3 and (a.repeats<a.maxrepeats or a.maxrepeats=0) and a.nextcheck<=1154949522 and a.mediatypeid=mt.mediatypeid order by a.clock
022752:20060807:151842 Executing query:select distinct i.itemid,i.key_,h.host,h.port,i.delay,i.descriptio n,i.nextcheck,i.type,i.snmp_community,i.snmp_oid,h .useip,h.ip,i.history,i.lastvalue,i.prevvalue,i.ho stid,h.status,i.value_type,h.errors_from,i.snmp_po rt,i.delta,i.prevorgvalue,i.lastclock,i.units,i.mu ltiplier,i.snmpv3_securityname,i.snmpv3_securityle vel,i.snmpv3_authpassphrase,i.snmpv3_privpassphras e,i.formula,h.available,i.status,i.trapper_hosts,i .logtimefmt,i.valuemapid from hosts h, items i, functions f where h.hostid=i.hostid and h.status=0 and i.status=0 and f.function in ('nodata','date','dayofweek','time','now') and i.itemid=f.itemid
022756:20060807:151842 In child_main()
022756:20060807:151842 server #6 started [Trapper]
022756:20060807:151842 Before DBconnect()
022756:20060807:151842 After DBconnect()
022756:20060807:151842 Before accept()
022760:20060807:151842 In child_main()
022760:20060807:151842 server #10 started [Trapper]
022760:20060807:151842 Before DBconnect()
022753:20060807:151842 In create_host_file()
022753:20060807:151842 Executing query:select distinct h.ip from hosts h,items i where i.hostid=h.hostid and (h.status=0 or (h.status=0 and h.available=2 and h.disable_until<=1154949522)) and (i.key_='icmpping' or i.key_='icmppingsec') and i.type=3 and i.status=0 and h.useip=1
022757:20060807:151842 In child_main()
022757:20060807:151842 server #7 started [Trapper]
022757:20060807:151842 Before DBconnect()
022758:20060807:151842 server #8 started [Trapper]
022758:20060807:151842 Before DBconnect()
022753:20060807:151842 Executing query:select distinct h.host from hosts h,items i where i.hostid=h.hostid and (h.status=0 or (h.status=0 and h.available=2 and h.disable_until<=1154949522)) and (i.key_='icmpping' or i.key_='icmppingsec') and i.type=3 and i.status=0 and h.useip=0
022753:20060807:151842 In do_ping()
022757:20060807:151842 After DBconnect()
022757:20060807:151842 Before accept()
022758:20060807:151842 After DBconnect()
022758:20060807:151842 Before accept()
022759:20060807:151842 In child_main()
022759:20060807:151842 server #9 started [Trapper]
022759:20060807:151842 Before DBconnect()
022750:20060807:151842 In housekeeping_process_log()
022750:20060807:151842 Executing query:select housekeeperid, tablename, field, value from housekeeper order by tablename
022760:20060807:151842 After DBconnect()
022760:20060807:151842 Before accept()
022750:20060807:151842 In housekeeping_alarms(1154949522)
022750:20060807:151842 Executing query:select alarm_history from config
022750:20060807:151842 Executing query:select alarmid from alarms where clock<1123413522
022750:20060807:151842 In housekeeping_alerts(1154949522)
022750:20060807:151842 Executing query:select alert_history from config
022750:20060807:151842 Executing query:delete from alerts where clock<1123413522
022759:20060807:151842 After DBconnect()
022759:20060807:151842 Before accept()
022750:20060807:151842 Deleted [0] records from table [alerts]
022750:20060807:151842 In housekeeping_sessions(1154949522)
022750:20060807:151842 Executing query:delete from sessions where lastaccess<1154863122
022750:20060807:151842 Deleted [0] records from table [sessions]
022750:20060807:151842 Sleeping for 1 hours
022750:20060807:151842 One server process died. Shutting down...
022750:20060807:151842 0. Killing PID=[22751]
022750:20060807:151842 1. Killing PID=[22752]
022751:20060807:151842 Server [1]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 2. Killing PID=[22753]
022752:20060807:151842 Server [2]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 3. Killing PID=[22754]
022753:20060807:151842 Server [3]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 4. Killing PID=[22755]
022750:20060807:151842 5. Killing PID=[22756]
022750:20060807:151842 6. Killing PID=[22757]
022756:20060807:151842 Server [6]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 7. Killing PID=[22758]
022757:20060807:151842 Server [7]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 8. Killing PID=[22759]
022758:20060807:151842 Server [8]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022759:20060807:151842 Server [9]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 9. Killing PID=[22760]
022760:20060807:151842 Server [10]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 ZABBIX server is down.
one process die (?) without any reason, and after that all other process killed by main and it quit. truss show this:
23098: open("/opt/csw/etc/snmp/snmp.conf", O_RDONLY) = 0
23098: ioctl(0, TCGETA, 0xFFBFE25C) Err#25 ENOTTY
23098: fstat64(0, 0xFFBFE2D0) = 0
23098: fstat64(0, 0xFFBFE178) = 0
23098: read(0, 0x001F392C, 8192) = 53
23098: \n d e f a u l t P o r t 1 9 1\n d e f V e r s i o n v 2 c\n
23098: d e f C o m m u n i t y p u b l i c\n\n
23098: read(0, 0x001F392C, 8192) = 0
23098: llseek(0, 0, SEEK_CUR) = 53
23098: close(0) = 0
23098: open("//.snmp/snmp.conf", O_RDONLY) Err#2 ENOENT
....many other places
23098: open("/opt/csw/var/net-snmp/zabbix_server.conf", O_RDONLY) Err#2 ENOENT
23098: open("/opt/csw/var/net-snmp/zabbix_server.local.conf", O_RDONLY) Err#2 ENOENT
23098: times(0xFFBFF1D8) = 172205057
23098: Incurred fault #1, FLTILL %pc = 0x002535E0
23098: siginfo: SIGILL ILL_ILLOPC addr=0x002535E0
23098: Received signal #4, SIGILL [default]
23098: siginfo: SIGILL ILL_ILLOPC addr=0x002535E0
23090: Received signal #18, SIGCLD, in sigsuspend() [caught]
23090: siginfo: SIGCLD CLD_KILLED pid=23098 status=0x0004
yeah, it look for specific option in snmp.conf but trying not only snmp.conf, but zabbix_server.conf too..? looks like my conf not enough. what it want ?
(server started with -c pathtoconfig option)
snmp.conf
------
defaultPort 191
defVersion v2c
defCommunity public
------
022750:20060807:151842 7. PID=[22758]
022750:20060807:151842 8. PID=[22759]
022750:20060807:151842 9. PID=[22760]
022750:20060807:151842 ZABBIX server is up.
022758:20060807:151842 In child_main()
022751:20060807:151842 Executing query:select a.alertid,a.mediatypeid,a.sendto,a.subject,a.messa ge,a.status,a.retries,mt.mediatypeid,mt.type,mt.de scription,mt.smtp_server,mt.smtp_helo,mt.smtp_emai l,mt.exec_path,a.delay,mt.gsm_modem from alerts a,media_type mt where a.status=0 and a.retries<3 and (a.repeats<a.maxrepeats or a.maxrepeats=0) and a.nextcheck<=1154949522 and a.mediatypeid=mt.mediatypeid order by a.clock
022752:20060807:151842 Executing query:select distinct i.itemid,i.key_,h.host,h.port,i.delay,i.descriptio n,i.nextcheck,i.type,i.snmp_community,i.snmp_oid,h .useip,h.ip,i.history,i.lastvalue,i.prevvalue,i.ho stid,h.status,i.value_type,h.errors_from,i.snmp_po rt,i.delta,i.prevorgvalue,i.lastclock,i.units,i.mu ltiplier,i.snmpv3_securityname,i.snmpv3_securityle vel,i.snmpv3_authpassphrase,i.snmpv3_privpassphras e,i.formula,h.available,i.status,i.trapper_hosts,i .logtimefmt,i.valuemapid from hosts h, items i, functions f where h.hostid=i.hostid and h.status=0 and i.status=0 and f.function in ('nodata','date','dayofweek','time','now') and i.itemid=f.itemid
022756:20060807:151842 In child_main()
022756:20060807:151842 server #6 started [Trapper]
022756:20060807:151842 Before DBconnect()
022756:20060807:151842 After DBconnect()
022756:20060807:151842 Before accept()
022760:20060807:151842 In child_main()
022760:20060807:151842 server #10 started [Trapper]
022760:20060807:151842 Before DBconnect()
022753:20060807:151842 In create_host_file()
022753:20060807:151842 Executing query:select distinct h.ip from hosts h,items i where i.hostid=h.hostid and (h.status=0 or (h.status=0 and h.available=2 and h.disable_until<=1154949522)) and (i.key_='icmpping' or i.key_='icmppingsec') and i.type=3 and i.status=0 and h.useip=1
022757:20060807:151842 In child_main()
022757:20060807:151842 server #7 started [Trapper]
022757:20060807:151842 Before DBconnect()
022758:20060807:151842 server #8 started [Trapper]
022758:20060807:151842 Before DBconnect()
022753:20060807:151842 Executing query:select distinct h.host from hosts h,items i where i.hostid=h.hostid and (h.status=0 or (h.status=0 and h.available=2 and h.disable_until<=1154949522)) and (i.key_='icmpping' or i.key_='icmppingsec') and i.type=3 and i.status=0 and h.useip=0
022753:20060807:151842 In do_ping()
022757:20060807:151842 After DBconnect()
022757:20060807:151842 Before accept()
022758:20060807:151842 After DBconnect()
022758:20060807:151842 Before accept()
022759:20060807:151842 In child_main()
022759:20060807:151842 server #9 started [Trapper]
022759:20060807:151842 Before DBconnect()
022750:20060807:151842 In housekeeping_process_log()
022750:20060807:151842 Executing query:select housekeeperid, tablename, field, value from housekeeper order by tablename
022760:20060807:151842 After DBconnect()
022760:20060807:151842 Before accept()
022750:20060807:151842 In housekeeping_alarms(1154949522)
022750:20060807:151842 Executing query:select alarm_history from config
022750:20060807:151842 Executing query:select alarmid from alarms where clock<1123413522
022750:20060807:151842 In housekeeping_alerts(1154949522)
022750:20060807:151842 Executing query:select alert_history from config
022750:20060807:151842 Executing query:delete from alerts where clock<1123413522
022759:20060807:151842 After DBconnect()
022759:20060807:151842 Before accept()
022750:20060807:151842 Deleted [0] records from table [alerts]
022750:20060807:151842 In housekeeping_sessions(1154949522)
022750:20060807:151842 Executing query:delete from sessions where lastaccess<1154863122
022750:20060807:151842 Deleted [0] records from table [sessions]
022750:20060807:151842 Sleeping for 1 hours
022750:20060807:151842 One server process died. Shutting down...
022750:20060807:151842 0. Killing PID=[22751]
022750:20060807:151842 1. Killing PID=[22752]
022751:20060807:151842 Server [1]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 2. Killing PID=[22753]
022752:20060807:151842 Server [2]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 3. Killing PID=[22754]
022753:20060807:151842 Server [3]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 4. Killing PID=[22755]
022750:20060807:151842 5. Killing PID=[22756]
022750:20060807:151842 6. Killing PID=[22757]
022756:20060807:151842 Server [6]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 7. Killing PID=[22758]
022757:20060807:151842 Server [7]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 8. Killing PID=[22759]
022758:20060807:151842 Server [8]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022759:20060807:151842 Server [9]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 9. Killing PID=[22760]
022760:20060807:151842 Server [10]. Got QUIT or INT or TERM or PIPE signal. Exiting...
022750:20060807:151842 ZABBIX server is down.
one process die (?) without any reason, and after that all other process killed by main and it quit. truss show this:
23098: open("/opt/csw/etc/snmp/snmp.conf", O_RDONLY) = 0
23098: ioctl(0, TCGETA, 0xFFBFE25C) Err#25 ENOTTY
23098: fstat64(0, 0xFFBFE2D0) = 0
23098: fstat64(0, 0xFFBFE178) = 0
23098: read(0, 0x001F392C, 8192) = 53
23098: \n d e f a u l t P o r t 1 9 1\n d e f V e r s i o n v 2 c\n
23098: d e f C o m m u n i t y p u b l i c\n\n
23098: read(0, 0x001F392C, 8192) = 0
23098: llseek(0, 0, SEEK_CUR) = 53
23098: close(0) = 0
23098: open("//.snmp/snmp.conf", O_RDONLY) Err#2 ENOENT
....many other places
23098: open("/opt/csw/var/net-snmp/zabbix_server.conf", O_RDONLY) Err#2 ENOENT
23098: open("/opt/csw/var/net-snmp/zabbix_server.local.conf", O_RDONLY) Err#2 ENOENT
23098: times(0xFFBFF1D8) = 172205057
23098: Incurred fault #1, FLTILL %pc = 0x002535E0
23098: siginfo: SIGILL ILL_ILLOPC addr=0x002535E0
23098: Received signal #4, SIGILL [default]
23098: siginfo: SIGILL ILL_ILLOPC addr=0x002535E0
23090: Received signal #18, SIGCLD, in sigsuspend() [caught]
23090: siginfo: SIGCLD CLD_KILLED pid=23098 status=0x0004
yeah, it look for specific option in snmp.conf but trying not only snmp.conf, but zabbix_server.conf too..? looks like my conf not enough. what it want ?
(server started with -c pathtoconfig option)
snmp.conf
------
defaultPort 191
defVersion v2c
defCommunity public
------
Comment