Ad Widget

Collapse

Actions sent for some but not others

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • cirrhus9.com
    Member
    • Feb 2012
    • 58

    #1

    Actions sent for some but not others

    Hello:
    We have zabbix server 1.8.10 on a CentOS 5.9 host.
    We have 76 hosts in 9 host groups. (This doesn't include the zabbixserver in the zabbixserver group)

    We have an [email protected] media that is present in all our Actions. One to the client and one to us.
    Those all are working.

    The ones we have labeled "c9Internal" were noticed today not to be sending. These only have one media - [email protected]

    We moved some servers from one grid to another. I changed the hosts' IP to point to the new ones. One server did not get migrated and remained on the old host.

    I enabled this group of hosts after making the changes and the one left on the old host showed up in the Dashboard as having issue but no action/alert is ever sent. This is when this issue became apparent.

    I have taken a client's working Action and cloned it and removed their media and assigned it an "AND / OR (A) and (B)" Action condition with the following criteria:
    Trigger value = "PROBLEM"
    Host group = "c9Internal"

    But this problem persists.

    Some interesting issues I notice in the zabbix_server.log are:
    Almost 8000 of these entries for these 2 hosts.
    Code:
    2227:20140207:064057.791 Sending list of active checks to [184.73.207.18] failed: host [web] not found
      2227:20140207:064146.181 Sending list of active checks to [75.101.139.254] failed: host [cirrhus9b] not found
    Those are very old hosts I used way back in the day when I originally started setting up this server. and...
    Code:
    2227:20140212:141631.362 [Z3005] query failed: [2006] MySQL server has gone away [select hostid,status from hosts where host='cirrhus9b' and status in (0,1) and proxy_hostid=0 and hostid between 000000000000000 and 099999999999999]
    2252:20140212:141631.716 [Z3005] query failed: [2006] MySQL server has gone away [begin;]
    2248:20140212:141631.736 [Z3005] query failed: [2006] MySQL server has gone away [begin;]
    2256:20140212:141632.136 [Z3005] query failed: [2006] MySQL server has gone away [select escalationid,actionid,triggerid,eventid,r_eventid,esc_step,status from escalations where status in (0,4,5,1) and nextcheck<=1392243392 and escalationid between 000000000000000 and 099999999999999]
    2231:20140212:141634.426 [Z3005] query failed: [2006] MySQL server has gone away [select hostid,status from hosts where host='Alfresco' and status in (0,1) and proxy_hostid=0 and hostid between 000000000000000 and 099999999999999]
    2244:20140212:141634.449 [Z3005] query failed: [2006] MySQL server has gone away [select t.httptestid,t.name,t.applicationid,t.nextcheck,t.status,t.macros,t.agent,t.authentication,t.http_user,t.http_password from httptest t,applications a,hosts h where t.applicationid=a.applicationid and a.hostid=h.hostid and t.nextcheck<=1392243394 and mod(t.httptestid,1)=0 and t.status=0 and h.status=0 and (h.maintenance_status=0 or h.maintenance_type=0) and t.httptestid between 000000000000000 and 099999999999999]
    2250:20140212:141636.745 [Z3005] query failed: [2006] MySQL server has gone away [begin;]
    2229:20140212:141636.897 [Z3005] query failed: [2006] MySQL server has gone away [select hostid,status from hosts where host='s-db2' and status in (0,1) and proxy_hostid=0 and hostid between 000000000000000 and 099999999999999]
    and haven't seen any further "MySQL server has gone away" message in the server.log since rebooting.
    I thought maybe it was the "internal" designation in the identifier, so I moved our hosts to a "Test" group and altered the action but it didn't make any difference.

    I've tried adding new individual media to our actions and even a foreign non work-related email host. None is ever received.

    Can anyone shed some light on what may be causing this or some additional troubleshooting steps I may take to resolve this?
    I'd sure appreciate it.

    Thank you for your time.
    Last edited by cirrhus9.com; 13-02-2014, 02:53.
  • fgallese
    Junior Member
    Zabbix Certified Specialist
    • Sep 2009
    • 20

    #2
    Regarding your problem with notifications, I can suggest you to check that the user you configured in the Action to be notified has the right permissions over the Host/HostGroup being alarmed.

    You are getting the "Sending list of active checks to.." error message probably beacause the "Hostname=" configuration in zabbix_agentd.conf file does not match the configured Hostname in the Zabbix Frontend. Those 2 have to be the same, otherwise this arror appears.

    Regarding the MySQL error, I myself have the same problem and I haven't found a solution yet; however, this doesn't affect the server in a noticeable way.

    Comment

    • cirrhus9.com
      Member
      • Feb 2012
      • 58

      #3
      Originally posted by fgallese
      Regarding your problem with notifications, I can suggest you to check that the user you configured in the Action to be notified has the right permissions over the Host/HostGroup being alarmed.
      Read/write on all hosts and hostgroups.
      Originally posted by fgallese
      You are getting the "Sending list of active checks to.." error message probably beacause the "Hostname=" configuration in zabbix_agentd.conf file does not match the configured Hostname in the Zabbix Frontend. Those 2 have to be the same, otherwise this arror appears.
      All the same. Which I'll have to re-verify as the zabbix server is on on "old" grid and will be moved soon.

      Originally posted by fgallese
      Regarding the MySQL error, I myself have the same problem and I haven't found a solution yet; however, this doesn't affect the server in a noticeable way.
      Well, I guess it's not "me" and that's ok.

      I got alerts for the host left on the old grid overnight. I guess it's a "good thing" as it's not affecting clients in any way, just our internal stuff.

      Thank you.
      Last edited by cirrhus9.com; 13-02-2014, 16:24.

      Comment

      • fgallese
        Junior Member
        Zabbix Certified Specialist
        • Sep 2009
        • 20

        #4
        I think I found a solution for the MySQL problem:

        On the zabbix_server server I had a different (older) version of MySQL from the one running at the DB server.

        This was the app Server:

        [root@myServer myUser]# mysql --version
        mysql Ver 14.14 Distrib 5.1.69, for redhat-linux-gnu (x86_64) using readline 5.1

        And this is the DB server:

        [root@myServer myUser]# mysql --version
        mysql Ver 14.14 Distrib 5.5.34, for Linux (x86_64) using readline 5.1

        So I run: "yum update mysql" on the first one, and restarted the zabbix_server application.

        The problem hasn't shown yet, It seems the update fixed it.

        Comment

        • cirrhus9.com
          Member
          • Feb 2012
          • 58

          #5
          That's good news!

          Maybe some internal process in the update corrected something not evident by external examination?

          Maybe a repair option is necessary even though I have no logging that says so? Just an idea.

          Code:
          mysql> status;
          --------------
          mysql  Ver 14.12 Distrib 5.0.95, for redhat-linux-gnu (x86_64) using readline 5.1
          CentOS release 5.9 (Final)
          yum update mysql yields "No Packages marked for Update"
          but yum update shows "centos-release-notes 5.10-0, and this says mysql has a 5.5 update in there.

          Thanks!
          JJ

          Comment

          Working...