Ad Widget

Collapse

flap detection

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • fab
    Junior Member
    • Jun 2005
    • 22

    #1

    flap detection

    Hi,

    I played a bit with the triggers to give zabbix some kind of flap detection. Basically it is working but there is a flea that I don't understand.

    How it works
    Let's suppose there is an item for host flap called flap[test] with the following trigger:

    ({flap:flap[test].prev(0)}+{flap:flap[test].abschange(0)})>74

    Values for flap[test] are being delivered through zabbix_sender for testing with the following script:

    Code:
    for rst in 67 77  46 79 64 77 57 76 61 76 64 78 73 76; do sleep 28 ; zabbix_sender server 10001 flap:flap[test] $rst ; done
    The values represent percantage use of cpu on flap and I want to be emailed if 75% or more of the cpu are being used.

    What it does
    The trigger stays on as long as the next but one value doesn't trigger. So we give the service some time to stabilize before assuming everything is okay. You can mitigate the situation if you append '& {flap:flap[test].last(0)}>70' to the trigger. So flapping only occurs from values 70+.

    In our example that would mean, that the trigger stays of until the first 77 and stays on for the whole testing series because the trigger assumes the service is flapping.

    Fleas
    As I mentioned before, the trigger basically works, but everytime I follow the test via latestalarms.php the trigger becomes from time to time status unknown. And I can't figure out why. My environment is zabbix1.0 on solaris 10.

    Regards,
    Frank.
  • fab
    Junior Member
    • Jun 2005
    • 22

    #2
    Hi,

    recently I found another flea in the above trigger statement. Whenever the service suffers big changes (in our case cpu load from 13% upto 64 and then back to 13%) the trigger gets on. Quite not what we wanted. So I changed the statement to

    {flap:flap[test].max(180)}>74

    Which is not a real flap detection but reliable. We're taking measurements every 60 seconds so we give the service the chance to stabilize within 3 measurements. The trigger is just used for alarming. For the 'high business view' we use an actionless .last(0)>74 trigger.

    Regards,
    Frank.

    PS: If somebody had realized flap detection with zabbix1.0 I would be glad to here about it.

    Comment

    • klavs
      Junior Member
      • Apr 2005
      • 18

      #3
      Anybody know if there's automatic flap detection (or just a way to get it) in zabbix 1.2beta? I'm considering zabbix or Nagios (which has automatic flap detection).

      Comment

      • James Wells
        Senior Member
        • Jun 2005
        • 664

        #4
        Greetings,

        Originally posted by klavs
        Anybody know if there's automatic flap detection (or just a way to get it) in zabbix 1.2beta? I'm considering zabbix or Nagios (which has automatic flap detection).
        At present, no, though this is a feature that people have asked for quite a few times. As was shown above in the thread, there are ways of doing a kind of sudo flap detection, but it's not true flap detection just yet.
        Unofficial Zabbix Developer

        Comment

        Working...