I have an item that performs a ping check and returns 1 if the ping was successful and 0 if the ping failed. I want to know if the state changed more than 5 times in a hour meaning that the device is sometimes not responding.
Here is an example graph of what the problem looks like.

I tried to use the avg() trigger to say if the average up time is less than 90% over 1 hour. This works but the problem is if a device goes actually goes offline during this period it will falsely trigger because the up time is less than 90%.
I thought if I could count the times it changed from a 1 to a 0 or vice versa I would be able to capture the problem. My other thinking is if the average over 1 hour is <90% and this problem persists the whole day then maybe trigger again?
Does anyone have an idea of how to do this? I've looked at the notes for the count() trigger but it doesn't seem to do what I want.
Here is an example graph of what the problem looks like.
I tried to use the avg() trigger to say if the average up time is less than 90% over 1 hour. This works but the problem is if a device goes actually goes offline during this period it will falsely trigger because the up time is less than 90%.
I thought if I could count the times it changed from a 1 to a 0 or vice versa I would be able to capture the problem. My other thinking is if the average over 1 hour is <90% and this problem persists the whole day then maybe trigger again?
Does anyone have an idea of how to do this? I've looked at the notes for the count() trigger but it doesn't seem to do what I want.
Comment