Ad Widget

Collapse

Zabbix 1.4 Crashing !!

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • Umair
    Member
    • Feb 2007
    • 86

    #1

    Zabbix 1.4 Crashing !!

    There is an unexpected bahvior in the execution of Zabbix server 1.4
    heres the log

    3870:20070619:164412 Query::select i.itemid,i.key_,h.host,h.port,i.delay,i.descriptio n,i.nextcheck,i.type,i.snmp_community,i.snmp_oid,h .useip,h.ip,i.history,i.lastvalue,i.prevvalue,i.ho stid,h.status,i.value_type,h.errors_from,i.snmp_po rt,i.delta,i.prevorgvalue,i.lastclock,i.units,i.mu ltiplier,i.snmpv3_securityname,i.snmpv3_securityle vel,i.snmpv3_authpassphrase,i.snmpv3_privpassphras e,i.formula,h.available,i.status,i.trapper_hosts,i .logtimefmt,i.valuemapid,i.delay_flex,h.dns from hosts h, items i where i.nextcheck<=1182264252 and i.status in (0,3) and i.type not in (2,7,9) and h.status=0 and h.disable_until<=1182264252 and h.errors_from=0 and h.hostid=i.hostid and mod(i.itemid,5)=2 and i.key_ not in ('status','icmpping','icmppingsec','zabbix[log]') and h.hostid>=100000000000000*0 and h.hostid<=(100000000000000*0+99999999999999) order by i.nextcheck
    3870:20070619:164412 Query failed:MySQL server has gone away [2006]
    3893:20070619:164413 Query::select druleid,iprange,delay,nextcheck,name,status from drules where status=0 and nextcheck<=ee1182264253 and mod(druleid,1)=0 and druleid>=100000000000000*0 and druleid<=(100000000000000*0+99999999999999)
    3893:20070619:164413 Query failed:MySQL server has gone away [2006]
    3871:20070619:164413 Query::select i.itemid,i.key_,h.host,h.port,i.delay,i.descriptio n,i.nextcheck,i.type,i.snmp_community,i.snmp_oid,h .useip,h.ip,i.history,i.lastvalue,i.prevvalue,i.ho stid,h.status,i.value_type,h.errors_from,i.snmp_po rt,i.delta,i.prevorgvalue,i.lastclock,i.units,i.mu ltiplier,i.snmpv3_securityname,i.snmpv3_securityle vel,i.snmpv3_authpassphrase,i.snmpv3_privpassphras e,i.formula,h.available,i.status,i.trapper_hosts,i .logtimefmt,i.valuemapid,i.delay_flex,h.dns from hosts h, items i where i.nextcheck<=1182264253 and i.status in (0,3) and i.type not in (2,7,9) and h.status=0 and h.disable_until<=1182264253 and h.errors_from=0 and h.hostid=i.hostid and mod(i.itemid,5)=3 and i.key_ not in ('status','icmpping','icmppingsec','zabbix[log]') and h.hostid>=100000000000000*0 and h.hostid<=(100000000000000*0+99999999999999) order by i.nextcheck
    3871:20070619:164413 Query failed:MySQL server has gone away [2006]
    3872:20070619:164414 Query::select i.itemid,i.key_,h.host,h.port,i.delay,i.descriptio n,i.nextcheck,i.type,i.snmp_community,i.snmp_oid,h .useip,h.ip,i.history,i.lastvalue,i.prevvalue,i.ho stid,h.status,i.value_type,h.errors_from,i.snmp_po rt,i.delta,i.prevorgvalue,i.lastclock,i.units,i.mu ltiplier,i.snmpv3_securityname,i.snmpv3_securityle vel,i.snmpv3_authpassphrase,i.snmpv3_privpassphras e,i.formula,h.available,i.status,i.trapper_hosts,i .logtimefmt,i.valuemapid,i.delay_flex,h.dns from hosts h, items i where i.nextcheck<=1182264254 and i.status in (0,3) and i.type not in (2,7,9) and h.status=0 and h.disable_until<=1182264254 and h.errors_from=0 and h.hostid=i.hostid and mod(i.itemid,5)=4 and i.key_ not in ('status','icmpping','icmppingsec','zabbix[log]') and h.hostid>=100000000000000*0 and h.hostid<=(100000000000000*0+99999999999999) order by i.nextcheck
    3872:20070619:164414 Query failed:MySQL server has gone away [2006]
    3868:20070619:164415 Query::select i.itemid,i.key_,h.host,h.port,i.delay,i.descriptio n,i.nextcheck,i.type,i.snmp_community,i.snmp_oid,h .useip,h.ip,i.history,i.lastvalue,i.prevvalue,i.ho stid,h.status,i.value_type,h.errors_from,i.snmp_po rt,i.delta,i.prevorgvalue,i.lastclock,i.units,i.mu ltiplier,i.snmpv3_securityname,i.snmpv3_securityle vel,i.snmpv3_authpassphrase,i.snmpv3_privpassphras e,i.formula,h.available,i.status,i.trapper_hosts,i .logtimefmt,i.valuemapid,i.delay_flex,h.dns from hosts h, items i where i.nextcheck<=1182264255 and i.status in (0,3) and i.type not in (2,7,9) and h.status=0 and h.disable_until<=1182264255 and h.errors_from=0 and h.hostid=i.hostid and mod(i.itemid,5)=0 and i.key_ not in ('status','icmpping','icmppingsec','zabbix[log]') and h.hostid>=100000000000000*0 and h.hostid<=(100000000000000*0+99999999999999) order by i.nextcheck
    3868:20070619:164415 Query failed:MySQL server has gone away [2006]
    3869:20070619:164416 Query::select i.itemid,i.key_,h.host,h.port,i.delay,i.descriptio n,i.nextcheck,i.type,i.snmp_community,i.snmp_oid,h .useip,h.ip,i.history,i.lastvalue,i.prevvalue,i.ho stid,h.status,i.value_type,h.errors_from,i.snmp_po rt,i.delta,i.prevorgvalue,i.lastclock,i.units,i.mu ltiplier,i.snmpv3_securityname,i.snmpv3_securityle vel,i.snmpv3_authpassphrase,i.snmpv3_privpassphras e,i.formula,h.available,i.status,i.trapper_hosts,i .logtimefmt,i.valuemapid,i.delay_flex,h.dns from hosts h, items i where i.nextcheck<=1182264256 and i.status in (0,3) and i.type not in (2,7,9) and h.status=0 and h.disable_until<=1182264256 and h.errors_from=0 and h.hostid=i.hostid and mod(i.itemid,5)=1 and i.key_ not in ('status','icmpping','icmppingsec','zabbix[log]') and h.hostid>=100000000000000*0 and h.hostid<=(100000000000000*0+99999999999999) order by i.nextcheck
    3869:20070619:164416 Query failed:MySQL server has gone away [2006]
    3890:20070619:164417 Query::select h.hostid,min(i.itemid) from hosts h,items i where mod(h.hostid,1)=0 and i.nextcheck<=1182264257 and i.status in (0) and i.type not in (2,7,9) and h.status=0 and h.disable_until<=1182264257 and h.errors_from!=0 and h.hostid=i.hostid and i.key_ not in ('status','icmpping','icmppingsec','zabbix[log]') and h.hostid>=100000000000000*0 and h.hostid<=(100000000000000*0+99999999999999) group by h.hostid
    After this, the server tries to restart itself :

    3890:20070619:164417 Query failed:MySQL server has gone away [2006]
    3865:20070619:164527 ZABBIX Server stopped
    3916:20070620:092914 Starting zabbix_server. ZABBIX 1.4.
    3916:20070620:092914 **** Enabled features ****
    3916:20070620:092914 SNMP monitoring: YES
    3916:20070620:092914 WEB monitoring: NO
    3916:20070620:092914 Jabber notifications: NO
    3916:20070620:092914 **************************
    3924:20070620:092914 server #6 started [Trapper]
    3925:20070620:092914 server #7 started [Trapper]
    3927:20070620:092914 server #8 started [Trapper]
    3930:20070620:092914 server #9 started [Trapper]
    3932:20070620:092914 server #10 started [Trapper]
    3934:20070620:092914 server #11 started [ICMP pinger]
    3936:20070620:092914 server #12 started [Alerter]
    3938:20070620:092914 server #13 started [Housekeeper]
    3938:20070620:092914 Executing housekeeper
    3941:20070620:092914 server #14 started [Timer]
    cat: write error: Broken pipe
    3946:20070620:092914 server #16 started [Node watcher. Node ID:0]
    3916:20070620:092914 server #0 started [Watchdog]
    No log handling enabled - turning on stderr logging
    Cannot find module (LM-SENSORS-MIB): At line 1 in (none)
    3921:20070620:092914 server #3 started [Poller. SNMP:ON]
    No log handling enabled - turning on stderr logging
    Cannot find module (LM-SENSORS-MIB): At line 1 in (none)
    No log handling enabled - turning on stderr logging
    Cannot find module (LM-SENSORS-MIB): At line 1 in (none)
    No log handling enabled - turning on stderr logging
    Cannot find module (LM-SENSORS-MIB): At line 1 in (none)
    3919:20070620:092914 server #1 started [Poller. SNMP:ON]
    No log handling enabled - turning on stderr logging
    Cannot find module (LM-SENSORS-MIB): At line 1 in (none)
    3922:20070620:092914 server #4 started [Poller. SNMP:ON]
    3943:20070620:092914 server #15 started [Poller for unreachable hosts. SNMP:ON]
    3920:20070620:092914 server #2 started [Poller. SNMP:ON]
    No log handling enabled - turning on stderr logging
    Cannot find module (LM-SENSORS-MIB): At line 1 in (none)
    3948:20070620:092914 server #17 started [Discoverer. SNMP:ON]
    No log handling enabled - turning on stderr logging
    Cannot find module (LM-SENSORS-MIB): At line 1 in (none)
    3923:20070620:092914 server #5 started [Poller. SNMP:ON]
    3938:20070620:092914 Deleted 0 records from history and trends

    But the same problem occurs again...
    It crashes once more...!!

    Any clue guys ???
  • Alexei
    Founder, CEO
    Zabbix Certified Trainer
    Zabbix Certified SpecialistZabbix Certified Professional
    • Sep 2004
    • 5654

    #2
    Who crashes?! MySQL?

    If MySQL is not up ZABBIX cannot do much, it will wait and pray that someone will detect and restart it. You may configure database watchdog to get a message when this happens again, by the way.
    Alexei Vladishev
    Creator of Zabbix, Product manager
    New York | Tokyo | Riga
    My Twitter

    Comment

    • Umair
      Member
      • Feb 2007
      • 86

      #3
      umm,
      Pretty strange, but neither of the two crashes !

      I mean, if MYSQL fails, then Zabbix will wait until MYSQL has been manually restarted.
      If MySQL is not up ZABBIX cannot do much, it will wait and pray that someone will detect and restart it

      But i did not have to restart MYSQL.

      When log generated the error, Zabbix itself restarted the Zabbix Server.!
      There was no indication about MYSQL.

      Do you think that MYSQL also restarts itself automatically ?
      And in such a case should i be assuming that there is a problem with my MYSQL ?

      Comment

      • Alexei
        Founder, CEO
        Zabbix Certified Trainer
        Zabbix Certified SpecialistZabbix Certified Professional
        • Sep 2004
        • 5654

        #4
        The message "Query failed:MySQL server has gone away [2006]" clearly suggegsts that MySQL server was stopped or restarted.
        Alexei Vladishev
        Creator of Zabbix, Product manager
        New York | Tokyo | Riga
        My Twitter

        Comment

        Working...