Hi
My SNMP host check is flapping - CPU goes over the alert threshold and sends alert notification. Then CPU usage drops, sends OK notification. Then after some time it is activated again, and remediated, and so on.
I want to create a trigger, that will activate on two conditions:
1) the CPU load is over the threshold (that's simple)
AND
2) the CPU load trigger was not activated in the last 10 hours.
This way, I would get only 1/2 alerts per day. The other thing I could use it for is to create another trigger with higher severity status, which would count the number of this triggers and alert if they were more than 10 of them in the past 24 hours.
I tried aggregate functions to count the triggers, but I cannot get them working. Also, there is no such a condition, which would count the trigger activations in the last T seconds.
My SNMP host check is flapping - CPU goes over the alert threshold and sends alert notification. Then CPU usage drops, sends OK notification. Then after some time it is activated again, and remediated, and so on.
I want to create a trigger, that will activate on two conditions:
1) the CPU load is over the threshold (that's simple)
AND
2) the CPU load trigger was not activated in the last 10 hours.
This way, I would get only 1/2 alerts per day. The other thing I could use it for is to create another trigger with higher severity status, which would count the number of this triggers and alert if they were more than 10 of them in the past 24 hours.
I tried aggregate functions to count the triggers, but I cannot get them working. Also, there is no such a condition, which would count the trigger activations in the last T seconds.
Comment