Other than using a more reliable zabbix server network, what can we do to stop getting hundreds of false alerts when the zabbix server network is having problems?
Any network issue that causes zabbix to think the monitored servers are down causes this. If zabbix can't send out emails because of the network issue then the email server keeps trying and we are flooded with emails once it starts working again. Also we get emails when zabbix thinks the servers are ok so then we get flooded with the "ok" emails. We set up a macro that monitors multiple websites and suppresses alerts when it can't reach the websites. That has reduced but not eliminated the problem. There are still scenarios where the false positives occur even though the monitored websites are still accessible.
We also get SMS messages when server down problems are detected, and also when servers come back up. So we get hundreds of false alert SMS messages
One way around this would be to delay major (server down) messages by about a minute so all server down messages in that time can be combined. If it's not a feature or easy to implement with Zabbix macros it probably should be.
Any network issue that causes zabbix to think the monitored servers are down causes this. If zabbix can't send out emails because of the network issue then the email server keeps trying and we are flooded with emails once it starts working again. Also we get emails when zabbix thinks the servers are ok so then we get flooded with the "ok" emails. We set up a macro that monitors multiple websites and suppresses alerts when it can't reach the websites. That has reduced but not eliminated the problem. There are still scenarios where the false positives occur even though the monitored websites are still accessible.
We also get SMS messages when server down problems are detected, and also when servers come back up. So we get hundreds of false alert SMS messages

One way around this would be to delay major (server down) messages by about a minute so all server down messages in that time can be combined. If it's not a feature or easy to implement with Zabbix macros it probably should be.
Comment