Good afternoon,
I am new to Zabbix (3.2.11) (Working on this from the Browser Admin Panel) and I am trying to support something I did not set up and I am not quite grasping how it works.
I have 3 triggers setup for a service that is running on servers and also an alert that checks if something us running/listening on a specific port.
If this service and program are not running I get emails and text messages letting me know it is down - then there is also an action attached to them saying if this trigger is true -restart the service and program.
So then a few mins later I get an OK email and text.
This is happening far too often lately so I want to change the triggers that are already here so they do not email/text. However, I want to create 2 new triggers that will email/text me if they have been down for a longer interval than what they are currently set to.
I was hoping someone could help me decipher how these are working and how to create a new trigger and what expressions I should use for longer intervals. I do not understand exactly the timing on these or how often it checks in or I should say how long these have been "down" for that Zabbix sends out the alert.
*DISASTER {Template App 2016 DBSync Service:net.tcp.service[tcp,,2507].sum(#5)}=0
*WARNING {Template App 2016 DBSync Service:net.tcp.service[tcp,,2507].sum(#3)}=0
*DISASTER {Template App 2016 DBSync Service:service.info[TTDBSyncServer, state].sum(#3)}>12
I want to keep these alerts how they are as it is set up to auto restart the service and application when this is triggered.
However, I want to create new triggers that only have 6 mins of being "down" (or if it's based on intervals make the interval range longer?) will it send out an email and text.
I tried the .sum(300)}=0 for a 5 min interval, but as soon as I did this I got flooded with alerts saying nothing was working when in fact they were.
Thanks for any help you can provide.
I am new to Zabbix (3.2.11) (Working on this from the Browser Admin Panel) and I am trying to support something I did not set up and I am not quite grasping how it works.
I have 3 triggers setup for a service that is running on servers and also an alert that checks if something us running/listening on a specific port.
If this service and program are not running I get emails and text messages letting me know it is down - then there is also an action attached to them saying if this trigger is true -restart the service and program.
So then a few mins later I get an OK email and text.
This is happening far too often lately so I want to change the triggers that are already here so they do not email/text. However, I want to create 2 new triggers that will email/text me if they have been down for a longer interval than what they are currently set to.
I was hoping someone could help me decipher how these are working and how to create a new trigger and what expressions I should use for longer intervals. I do not understand exactly the timing on these or how often it checks in or I should say how long these have been "down" for that Zabbix sends out the alert.
*DISASTER {Template App 2016 DBSync Service:net.tcp.service[tcp,,2507].sum(#5)}=0
*WARNING {Template App 2016 DBSync Service:net.tcp.service[tcp,,2507].sum(#3)}=0
*DISASTER {Template App 2016 DBSync Service:service.info[TTDBSyncServer, state].sum(#3)}>12
I want to keep these alerts how they are as it is set up to auto restart the service and application when this is triggered.
However, I want to create new triggers that only have 6 mins of being "down" (or if it's based on intervals make the interval range longer?) will it send out an email and text.
I tried the .sum(300)}=0 for a 5 min interval, but as soon as I did this I got flooded with alerts saying nothing was working when in fact they were.
Thanks for any help you can provide.
Comment