Hi,
we're monitoring some service availability from 3 different locations (specifically from 3 VPS around the world). Each location has it's own Zabbix Proxy to cache data in case of Zabbix Server unavailability.
There is one central Trigger defined with expression as follows - it fires only when service is unavailable from all three locations:
({location1:status.regexp(^UP$)}<>1 or {location1:status.nodata(5m)}=1) and ({location2:status.regexp(^UP$)}<>1 or {location2:status.nodata(5m)}=1) and ({location3:status.regexp(^UP$)}<>1 or {location3:status.nodata(5m)}=1)
Everything works fine while there is working internet connection between Zabbix Server and Proxies in their locations. Last week we had some ISP problems, causing trigger to fire and being active for about 7 hours. After internet connection was restored all missing data from 7 hours has been received from proxies (so the monitored services actually worked fine), but when I look at Monitoring - Services there is an SLA degradation for a 7 hours - the data wasn't updated!
This is not what I expected creating separate Zabbix Proxy for each location to prevent internet connection downtimes affecting SLA.
Is this normal behavior? How to avoid it?
we're monitoring some service availability from 3 different locations (specifically from 3 VPS around the world). Each location has it's own Zabbix Proxy to cache data in case of Zabbix Server unavailability.
There is one central Trigger defined with expression as follows - it fires only when service is unavailable from all three locations:
({location1:status.regexp(^UP$)}<>1 or {location1:status.nodata(5m)}=1) and ({location2:status.regexp(^UP$)}<>1 or {location2:status.nodata(5m)}=1) and ({location3:status.regexp(^UP$)}<>1 or {location3:status.nodata(5m)}=1)
Everything works fine while there is working internet connection between Zabbix Server and Proxies in their locations. Last week we had some ISP problems, causing trigger to fire and being active for about 7 hours. After internet connection was restored all missing data from 7 hours has been received from proxies (so the monitored services actually worked fine), but when I look at Monitoring - Services there is an SLA degradation for a 7 hours - the data wasn't updated!
This is not what I expected creating separate Zabbix Proxy for each location to prevent internet connection downtimes affecting SLA.
Is this normal behavior? How to avoid it?
Comment