Ad Widget

Collapse

new installation - hosts added but not monitored

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • lorell
    Junior Member
    • Oct 2005
    • 14

    #1

    new installation - hosts added but not monitored

    I am trying to bring up zabbix for the first time. I have five hosts that all I want to do is monitor to see if they are up via ICMP. I cannot seem to figure out how to get the monitoring, triggers and such running.

    Here is what it says in [STATUS OF ZABBIX]:

    Is zabbix_suckerd running ? Yes
    Is zabbix_trapperd running ? Yes
    Number of values stored 0
    Number of trends stored 0
    Number of alarms 208
    Number of alerts 0
    Number of triggers (enabled/disabled) 209(209/0)
    Number of items (active/trapper/not active/not supported) 234(234/0/0/0)
    Number of users 4
    Number of hosts (monitored/not monitored/templates) 7(0/0/1)

    How do I get it to start monitoring these hosts?
  • lorell
    Junior Member
    • Oct 2005
    • 14

    #2
    This may also help in the diagnosis:

    /etc/rc.d/init.d/zabbix_suckerd_ctl start: zabbix_suckerd started
    [root@zabbix ~]# No log handling enabled - turning on stderr logging
    snmpget: Timeout
    snmpget: Timeout
    snmpget: Timeout
    snmpget: Timeout
    snmpget: Timeout
    snmpget: Timeout

    Comment

    • lorell
      Junior Member
      • Oct 2005
      • 14

      #3
      Here's more info.

      I can see the box making attempts to contact other devices on port 10000, but I don't need any of that extra stuff. The devices are just video encoders and decoders. They won't ever listen on port 10000 for instructions from zabbix.

      My tcpdump. zabbix.emcsol.com is the zabbix server and the other two devices, 192.168.1.14 and 192.168.1.12 are both devices on the network that I just want to see if that are alive via ICMP echo requests (pings).

      00:48:32.841514 IP zabbix.emcsol.com.33363 > 192.168.1.14.10000: S 2118923753:2118923753(0) win 5840 <mss 1460,sackOK,timestam
      p 286730814 0,nop,wscale 2>
      00:48:32.841736 IP 192.168.1.14.10000 > zabbix.emcsol.com.33363: R 0:0(0) ack 2118923754 win 0
      00:48:32.864393 IP zabbix.emcsol.com.33364 > 192.168.1.12.10000: S 2122995833:2122995833(0) win 5840 <mss 1460,sackOK,timestam
      p 286730837 0,nop,wscale 2>
      00:48:32.885354 IP 192.168.1.12.10000 > zabbix.emcsol.com.33364: R 0:0(0) ack 2122995834 win 0

      Comment

      • lorell
        Junior Member
        • Oct 2005
        • 14

        #4
        Also, my triggers are not sending me alerts about these devices which are down. Or which I think are down according to zabbix. How do I get a good set of generic triggers/alerts applied to each host?

        The zabbix manual is very terse about these topics and seems to assume that I will be running zabbix_agentd on each of my monitored devices.

        Thanks!

        Lorell

        Comment

        • lorell
          Junior Member
          • Oct 2005
          • 14

          #5
          Another piece of the puzzle.

          [root@zabbix log]# ps aux | grep zabbix
          zabbix 2253 0.0 0.1 2780 772 ? SN Oct13 0:03 /usr/local//zabbix/bin/zabbix_agentd
          zabbix 2255 0.0 0.1 2780 604 ? SN Oct13 0:00 /usr/local//zabbix/bin/zabbix_agentd
          zabbix 2256 0.0 0.1 2780 604 ? SN Oct13 0:00 /usr/local//zabbix/bin/zabbix_agentd
          zabbix 2257 0.0 0.1 2780 604 ? SN Oct13 0:00 /usr/local//zabbix/bin/zabbix_agentd
          zabbix 2258 0.0 0.1 2780 604 ? SN Oct13 0:00 /usr/local//zabbix/bin/zabbix_agentd
          zabbix 2259 0.0 0.1 2780 604 ? SN Oct13 0:00 /usr/local//zabbix/bin/zabbix_agentd
          zabbix 2265 0.0 0.1 4244 780 ? S Oct13 0:00 /usr/local/zabbix/bin/zabbix_trapperd
          zabbix 2361 0.0 0.1 4244 780 ? S Oct13 0:00 /usr/local/zabbix/bin/zabbix_trapperd
          zabbix 2362 0.0 0.1 4244 780 ? S Oct13 0:00 /usr/local/zabbix/bin/zabbix_trapperd
          zabbix 2612 0.0 0.1 4244 780 ? S Oct13 0:00 /usr/local/zabbix/bin/zabbix_trapperd
          zabbix 2613 0.0 0.1 4244 780 ? S Oct13 0:00 /usr/local/zabbix/bin/zabbix_trapperd
          zabbix 2614 0.0 0.1 4244 780 ? S Oct13 0:00 /usr/local/zabbix/bin/zabbix_trapperd
          zabbix 18812 0.0 0.4 7924 2196 ? S 00:39 0:00 /usr/local/zabbix/bin/zabbix_suckerd
          zabbix 18814 0.0 0.4 7924 2184 ? S 00:39 0:00 /usr/local/zabbix/bin/zabbix_suckerd
          zabbix 18815 0.0 0.4 7924 2176 ? S 00:39 0:00 /usr/local/zabbix/bin/zabbix_suckerd
          zabbix 18816 0.0 0.4 7924 2188 ? S 00:39 0:00 /usr/local/zabbix/bin/zabbix_suckerd
          zabbix 18817 0.8 0.4 7924 2336 ? S 00:39 0:12 /usr/local/zabbix/bin/zabbix_suckerd
          root 19174 0.0 0.1 3896 672 pts/1 S+ 01:03 0:00 grep zabbix

          Comment

          • lorell
            Junior Member
            • Oct 2005
            • 14

            #6
            BTW I am running with MySQL on a CentOS 4.1 machine.

            Comment

            • elkor
              Senior Member
              • Jul 2005
              • 299

              #7
              OK, you don't state what you added the hosts as, what items they are watching, or what version of zabbix you are running.

              I'm going to assume you are running version 1.0 because you mention seeing port 10000 communications.. this was the 1.0 default (although old items still use this on the beta versions). The presence of port 10000 communications suggests that you are checking items of a "zabbix agent" type; furthermore some of your other logs suggest that you are attempting to connect and retrieve data via SNMP.

              Assuming your inital post was correct and all you are looking for is ICMP ping then you don't need any of these items listed under the host. you can deactive them, remove them, or re-create the host using either no template or the "standalone" one.

              if all you are looking for is ping then you only need that item to start with under each host and it should be of the "simple check" type. the syntax for it is located here: http://www.zabbix.com/manual/v1.1/config_items.php under the "simple checks" section. You also need the fping binary installed on the system and the path to it should be in the server's configuration file (default is /usr/bin/fping)

              hope this helps

              Comment

              • lorell
                Junior Member
                • Oct 2005
                • 14

                #8
                Thanks for taking the time to help me out.

                Yes, I am using Zabbix 1.0.

                I did not intentionally turn on the SNMP stuff, but I do notice that I get a lot of the snmpget timeout errors. Even when I use snmpget from the command line, I get those snmpget timeout errors. But, as I say, I'm not looking for snmp from these devices. (I will want to receive snmp traps from these devices later.)

                Also, I added the hosts as Host.standalone.

                Here is what is in my hosts section of zabbix:

                10016 Decoder 10000 Unreachable Change
                10017 Encoder 10000 Unreachable Change
                10018 Gateway 10000 Unreachable Change
                10015 MPC 10000 Unreachable Change
                10020 Wireless_AP 10000 Unreachable Change
                10019 Wireless_SM 10000 Unreachable Change
                10004 ZZ-Host-Template 10000 Template Change


                I modified the Host.standalone template and to just the following:

                ZZ-Host-Template

                Id Key Description Update interval History Shortname Type Status Actions
                17219 Defaults 30 365 ZZ-Host-Template: Zabbix agent Active Change
                18316 icmpping Ping (ICMP Echo) 30 365 ZZ-Host-Template:icmpping Zabbix agent Active Change
                18317 icmppingsec Ping (ICMP Echo) timed 30 365 ZZ-Host-Template:icmppingsec Zabbix agent Active Change
                18314 ping Ping to the server (TCP) 30 365 ZZ-Host-Templateing Zabbix agent Active Change

                The URL is here: http://66.143.101.105/zabbix

                Comment

                • elkor
                  Senior Member
                  • Jul 2005
                  • 299

                  #9
                  ok, the "MPC" host looks to be the one that is configured for SNMP and is the reason for all the error messages in that respect.

                  it's a little hard to read, but it looks like you have the correct items configured but they are set to "zabbix agent active" type instead of "simple check". try clicking on the "change" link and modifying the type from the drop down box.

                  Comment

                  • lorell
                    Junior Member
                    • Oct 2005
                    • 14

                    #10
                    Thanks for the help!

                    That was exactly the push that I needed. It is collecting stats now and seems to be doing all of the things that I would expect it to do.

                    Thanks again!

                    Lorell

                    Comment

                    Working...