Ad Widget

Collapse

Server is unreachable [only after 5 min]

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • ittec
    Member
    • Mar 2008
    • 73

    #1

    Server is unreachable [only after 5 min]

    Hi

    we are experiencing some problem with trigger "Server is unreachable". When a host is very very busy, by some reason Zabbix Server can't stablish connection with Zabbix Agentd during some seconds then trigger is fired and action starts, sending SMS and Email.

    I'm finding some way to continue monitoring of host falled but I want that only alerts when some reasonable kind of time has gone.

    Maybe adding more conditions to standard trigger?

    {Template_Linux:status.last(0)}=2&{Template_Linux: status.min(120)}=2

    Some idea guys?

    Thanks
  • MrKen
    Senior Member
    • Oct 2008
    • 652

    #2
    This is what I use:
    ({ZabbixServer:agent.ping.last(0)}#1)|({ZabbixServer:agent.ping.nodata(180)}=1)

    It seems to work ok, however, I wonder whether I should replace the OR operator with an AND operator. But, as I say, it works. You can change 180 to 300 (5 minutes) or whatever.

    MrKen
    Last edited by MrKen; 27-10-2009, 10:47. Reason: remove link from cut n paste
    Disclaimer: All of the above is pure speculation.

    Comment

    • ittec
      Member
      • Mar 2008
      • 73

      #3
      Hi MrKen

      ({ZabbixServer:agent.ping.last(0)}#1)|({ZabbixServ er:agent.ping.nodata(180)}=1)
      thanks by your reply. One question:

      1. Why do you use "#" on the first part of trigger? It's a mistake or has a real meaning?

      So if your query works, maybe I can use it to another triggers with the same intention (Avoid false warnings)

      Template_Linuxroc.num[httpd].last(0)}<1&{Template_Linuxroc.num[httpd].delta(180)}<1
      2. it can be changed to...?

      Template_Linuxroc.num[httpd].last(0)}<1&{Template_Linux:proc.num[httpd].nodata(180)}<1

      Thanks

      PD:Im testing your rule and I think is working well.
      Last edited by ittec; 27-10-2009, 16:53.

      Comment

      • MrKen
        Senior Member
        • Oct 2008
        • 652

        #4
        Originally posted by ittec

        1. Why do you use "#" on the first part of trigger? It's a mistake or has a real meaning?
        # means Not Equal to

        See the latest manual p.123 Section 4.13.1 Expressions for Triggers.

        MrKen
        Disclaimer: All of the above is pure speculation.

        Comment

        • subba5678
          Senior Member
          • May 2010
          • 132

          #5
          Hi Mrknen,
          Can you please help me what is the exact difference between the below 2 templates . I want to monitor Zabbix Agent . Please help me out which one i need to use.

          {Template_Windows:status.min(300)}=2

          {ZabbixServer:agent.ping.last(0)}



          Thanks,
          Subbu

          Comment

          • MrKen
            Senior Member
            • Oct 2008
            • 652

            #6
            I shall answer in reverse order.

            {ZabbixServer:agent.ping.last(0)}
            If ping is successful it will return a value of 1, if ping fails the value is 0. So, if the last value is 0 (zero), the trigger will fire.

            {Template_Windows:status.min(300)}=2
            The use of 'status' for host availability is not recommended, says Alexei, so I don't use it at all! See here: http://www.zabbix.com/forum/showpost...59&postcount=4

            For availability of zabbix_agent, I monitor 'agent.ping' and also 'icmpping'. I set the trigger on icmpping like this:
            ({Host:icmpping.max(300)}=0)|({Host:icmpping.nodat a(300)}=1)

            This means that if the maximum value is 0 (ping fail) for 5 minutes OR there is nodata for 5 minutes, alert me.

            MrKen
            Disclaimer: All of the above is pure speculation.

            Comment

            • EnigmA-X
              Senior Member
              Zabbix Certified Specialist
              • Oct 2010
              • 116

              #7
              Originally posted by subba5678
              Hi Mrken,
              Thanks for the reply , I want to use the maximum expression to monitor any service (0r) availability , for that i am using the below template , but I am getting the false alerts from them , Please help me out ,

              {ATL640erf_counter["\Processor(_Total)\% Processor Time"].max(3600)}>90


              Thanks,
              Subbu
              I would like to kindly suggest that you start reading here and stop hijacking forum topics from other users, as this is really not polite

              Please do not expect that other people will troubleshoot all your issues, without trying to solve them yourself. For what it's worth, the syntax of your trigger (not template) looks wrong to me.

              Comment

              • subba5678
                Senior Member
                • May 2010
                • 132

                #8
                Hi EnigmA-X,

                Before posting the forum i tried in different ways to solve the issue by myself , If any one seeking for the help means , he tried in all the ways and finally not able to fix the issue . then only people will depend on the Forum . Forum is the place where we can find solutions to all the problems , if its not why we require Google , search engines e.tc , I am not hijacking any forums , i am trying to see what all the issues user are facing while using zabbix , I alredy gone through the Zabbix Manual. According to me Zabbix is a Very Good Tool in the monitoring , we already put the zabbix in production , my intension is not get any down time . If you are Intrested and ready to help please help me ,I hope and thanks for understanding .

                Thanks,
                Subbu

                Comment

                • EnigmA-X
                  Senior Member
                  Zabbix Certified Specialist
                  • Oct 2010
                  • 116

                  #9
                  Subbu,

                  Ok, for a simple start: with hijacking forum topics, I mean that you are putting *your* questions, in the topic from *someone else*.

                  If you have a problem or a question, please start a new forum topic. This keeps the forum clean, readable, but even more important: searchable for other people with the same questions/problems.

                  So...please put your question in a new topic and I'm sure there are plenty of nice people willing to help you!

                  Comment

                  • subba5678
                    Senior Member
                    • May 2010
                    • 132

                    #10
                    Hi EnigmA-X,

                    I am posting my queries in the other because i already tried with all the procedure which was updated in that forum but still i am not able to find it out , so that is the reason i posted there , I agree with your point to start a new topic , if some one as already post their views in one forum , there might be a chance the other people wont respond to that , as the solution was already posted , if there no posts related to that query then i will post a new thread ,I posted here because i found my issue is closely related to this post . If you find solution for the below issue . please help me


                    {TAL720erf_counter["\Processor(_Total)\% Processor Time"].max(3600)}>90

                    Thanks,
                    Subbu

                    Comment

                    • drumspirit
                      Junior Member
                      • Mar 2014
                      • 13

                      #11
                      For availability of zabbix_agent, I monitor 'agent.ping' and also 'icmpping'. I set the trigger on icmpping like this:
                      ({Host:icmpping.max(300)}=0)|({Host:icmpping.nodat a(300)}=1)

                      For me, the trigger nodata(300)=1 doesn't work.
                      If I set it to "1", the trigger doesn't fire at all. If I set it to "0", the tiggrer fires after 30seconds.

                      Comment

                      • ingus.vilnis
                        Senior Member
                        Zabbix Certified Trainer
                        Zabbix Certified SpecialistZabbix Certified Professional
                        • Mar 2014
                        • 908

                        #12
                        Hi,

                        To check availability of zabbix_agentd you can also use Zabbix internal item with key:
                        Code:
                        zabbix[host,agent,available]
                        This item will return you 1 when available and 0 when unavailable which also makes the trigger pretty simple:
                        Code:
                        {<host or template>:zabbix[host,agent,available].last(300)}=0
                        You can read more on internal items here: https://www.zabbix.com/documentation...types/internal

                        Best Regards,
                        Ingus

                        Comment

                        Working...