I apologize if this has been fully addressed in another post, but I've spent a significant amount of time searching the Zabbix forums, the Zabbix documentation, and even the book Zabbix 4 Network Monitoring - Third Edition for answers. Unfortunately, I haven't found anything that really answers my questions and, at best, the information I have found is conflicting.
Disclaimer: We're in the process of setting up Zabbix for the first time in our environment so we're rather new to the system.
Server and Agent versions:
We've successfully delayed sending problem alert messages by 10 minutes using escalation in our Action. Our intention is to account for situations where an issue arises and subsequently resolves itself within the 10 minute time frame so that we only receive alerts for things we deem urgent. Unfortunately, Zabbix continues to send recovery messages during that time frame even though no problem messages were sent.
We would like to only send a recovery message after a problem message is actually sent. I understand that Recovery Operations does not have an escalation option, but it doesn't make much sense to send recovery messages when associated problem messages were never sent.
I've tried using functions like min(10m) in our trigger expression, but the trigger flips to a problem state well before 10 minutes has passed and problem and recovery messages are immediately sent. It's possible our trigger expressions aren't set up correctly and/or our testing methodology is flawed.
I'm intentionally leaving details out to keep this post from being a million pages long, but I'm happy to provide additional details if needed.
Any assistance is greatly appreciated!
Thank you,
Jason
Disclaimer: We're in the process of setting up Zabbix for the first time in our environment so we're rather new to the system.
Server and Agent versions:
- Zabbix Server Version: 4.0.11
- Zabbix Agent Version: 4.0.12
We've successfully delayed sending problem alert messages by 10 minutes using escalation in our Action. Our intention is to account for situations where an issue arises and subsequently resolves itself within the 10 minute time frame so that we only receive alerts for things we deem urgent. Unfortunately, Zabbix continues to send recovery messages during that time frame even though no problem messages were sent.
We would like to only send a recovery message after a problem message is actually sent. I understand that Recovery Operations does not have an escalation option, but it doesn't make much sense to send recovery messages when associated problem messages were never sent.
I've tried using functions like min(10m) in our trigger expression, but the trigger flips to a problem state well before 10 minutes has passed and problem and recovery messages are immediately sent. It's possible our trigger expressions aren't set up correctly and/or our testing methodology is flawed.
I'm intentionally leaving details out to keep this post from being a million pages long, but I'm happy to provide additional details if needed.
Any assistance is greatly appreciated!
Thank you,
Jason

Comment