Surely we must be missing something simple here...
If a server goes down all our triggers do into "unknown" state, a state which we cannot alert on.
For example we have an item is zabbix agent running, proc.num[zabbix_agentd].
And a trigger {MM_Template_Zabbix_with_zagent
roc.num[zabbix_agentd].last(0)}<1
But if the service is stopped it comes back unknown and doesn't fire an alert.
Likewise we have a ping to the host ip but if the host is down the trigger goes into an unknown status and again doesn't fire an alert.
So how do you actually monitor if a host is down if the items/triggers go into "UNKNOWN" status and you cannot alert on that?
If a server goes down all our triggers do into "unknown" state, a state which we cannot alert on.
For example we have an item is zabbix agent running, proc.num[zabbix_agentd].
And a trigger {MM_Template_Zabbix_with_zagent
roc.num[zabbix_agentd].last(0)}<1But if the service is stopped it comes back unknown and doesn't fire an alert.
Likewise we have a ping to the host ip but if the host is down the trigger goes into an unknown status and again doesn't fire an alert.
So how do you actually monitor if a host is down if the items/triggers go into "UNKNOWN" status and you cannot alert on that?
Comment