Ad Widget

Collapse

Importance of Recovery messages

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • dcampos
    Junior Member
    • May 2015
    • 4

    #1

    Importance of Recovery messages

    Hello!

    I been using Zabbix for the last 6 years to monitor most of our production (public facing) environments. Recently I got moved to a new department which has a separate zabbix server which monitor another bunch of internal systems.

    Recently, I was surprised by the fact that in this new team they have eliminated ALL the recovery messages with the simple excuse that they have cut the amount of notification by the half. I personally think that this approach effectively reduces the amount of outgoing notifications but it is also taking away functionality. Also the other reason why they do it that way is to force the on-call to take action every time, which certainly is a great formula to INCREASE the alert fatigue.

    I personally find very important to get recovery messages. I mean it is kind of obvious, if I get alerts about a system going into bad condition I would also like to know if the system was able to recover or if someone fixed it, without having to go and look into zabbix and the system every time.

    Every organization is different so I would like to hear your opinions about the relevance of recovery messages in your monitoring.
  • dcampos
    Junior Member
    • May 2015
    • 4

    #2
    Seriously no replies?!?!

    Has anybody disabled recovery messages to lower the amount of notifications for the on-call?

    As mentioned above, I don't like the idea. That's why I would love to hear more opinions...

    Comment

    • Atsushi
      Senior Member
      • Aug 2013
      • 2028

      #3
      I decide the presence or absence of a recovery message depending on the contents of obstacles.

      For example, when I monitor the state of resources such as CPU, I want to grasp that resource shortage has improved, so I will send a recovery message.

      On the other hand, in the case of log monitoring, since the status is not necessarily improved even if a log not matching the conditional expression is output, even if the trigger status is normal, the message will not be sent.
      I am setting not to send a recovery message for log monitoring.

      Comment

      • dcampos
        Junior Member
        • May 2015
        • 4

        #4
        Atsushi,

        What you described makes total sense... I like it!

        In my case, my new team simply removed ALL the recovery messages which is very unpractical.

        Thanks for sharing!

        Comment

        Working...