Hi all,
We have periodic (daily) planned reboots on some servers and I realized a strange random behavior of actions:
Day1:
1) Host goes down (unreachable) --> (first step of the) action is executed as expected (which is a external script here)
2) Host comes up again (.status AND trigger/event tells "OK") --> action escalation runs through all steps as defined in the action
So internally the action thinks the server is down, but everything else tells me that the machine is up.
Day2 (next reboot):
3) Host goes down again (unreachable) --> (first step of the) action is executed as expected again (which is a external script as we know)
4) Host comes up again (.status AND trigger/event tells "OK") --> action immediately releases unreachable and sends recovery message to all the recepients who got the 'wrong' escalation in step 2.
Anybody who can confirm this?
We are on v1.6.5...
We have periodic (daily) planned reboots on some servers and I realized a strange random behavior of actions:
Day1:
1) Host goes down (unreachable) --> (first step of the) action is executed as expected (which is a external script here)
2) Host comes up again (.status AND trigger/event tells "OK") --> action escalation runs through all steps as defined in the action
So internally the action thinks the server is down, but everything else tells me that the machine is up.
Day2 (next reboot):
3) Host goes down again (unreachable) --> (first step of the) action is executed as expected again (which is a external script as we know)
4) Host comes up again (.status AND trigger/event tells "OK") --> action immediately releases unreachable and sends recovery message to all the recepients who got the 'wrong' escalation in step 2.
Anybody who can confirm this?
We are on v1.6.5...
Comment