Ad Widget

Collapse

Simple Checks & Time Out's

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • entorno
    Junior Member
    • Dec 2006
    • 21

    #1

    Simple Checks & Time Out's

    Hello all...

    We've had Zabbix running a number of days now, and have moved from test box to production, and we are VERY impressed with it. So respect to all the developers that have contributed to the project. Your work is appreciated. (Credit where credit is due)

    The setup is roughly like this. We have created a template that is all simple checks for the likes of plesk, cpanel, http, https, pop3, smtp, ftp, etc, etc. The triggers notify us if any of those services goes down on any of the hosts.

    Now, the problem mainly is with smtp and pop, that when using the likes of:

    {Linux_Simpleop.prev(0)}#1

    (NB: Was an 64bit num.int.)

    that as so many people are checking their mail, and so much spam coming in via smtp, that Zabbix times out the connection and gives a non 1 value, resulting in an alert... so I tried this:

    {Linux_Simpleop_perf.prev(0)}>7

    (NB: Was a num.float)

    This did help, ... saying if after 7 seconds the host did not respond, trigger an alert. But sometimes it still tiggered.

    Ideally, we need to say this:

    "Is the previous value > 7 and the current value > 7 ?? If so, trigger an alert"

    This was we can base the alert happening that twice, the host has timed out after 7 seconds.

    What interests us is what is the default time out of a simple test? Is it editable also? So if just using pop and not pop_perf, how does Zabbix decide that the hosts service has not responded..

    Ok, thats it.. I await your answers

    Regards to you all...

    Entorno Digital EspaƱa
  • entorno
    Junior Member
    • Dec 2006
    • 21

    #2
    Mmmm my trying the following:

    ({Linux_Simpleop_perf.last(0)}>7)&({Linux_Simpleop_perf.prev(0)}>7)

    But it's not triggering when a host's pop service is disabled...

    :-((

    I also notice the values returned are 0

    Any ideas?
    Last edited by entorno; 22-12-2006, 13:30.

    Comment

    • Calimero
      Senior Member
      • Nov 2006
      • 481

      #3
      Return values for simple checks (an ICMP Ping, IIRC):
      1 = OK
      0 = rejected/port closed
      2 = timeout

      As far as I know you'll never have 7.

      Comment

      • entorno
        Junior Member
        • Dec 2006
        • 21

        #4
        Thanks for the reply, and your quite correct, however...

        I'm using xyz_perf , which returns the number of milliseconds it to the host to respond... however, being so stupid, I forgot that the reason it is returning 0 is because it failed to connect, thus the correct line is:

        ({Linux_Simpleop_perf.last(0)}=0)&({Linux_Simpleop_perf.prev(0)}=0)

        Thus if last = 0.002 and prev = 0.013, no problems... server is running..

        This all seems to work now..

        Ciao y saludos
        Last edited by entorno; 22-12-2006, 14:32.

        Comment

        • Calimero
          Senior Member
          • Nov 2006
          • 481

          #5
          Oops ! Read you post too fast and I didn't see you were using _perf

          Comment

          Working...