I've added the following trigger to the default Template_Windows template:
nodata(/Template_Windows/agent.ping,180)<>0
The goal is to know when/if the agent becomes unavailable and send a Notification so we can investigate further.
However, despite knowing that the agent isn't becoming unavailable, I'm seeing Problems generated on this trigger for multiple hosts.
The hosts in question are in two groups - each group monitored by a different proxy, and the Queue will often show many items in the 1minute column, but rarely in the 5 minute column. I could assume that these checks are just building up in the queue and triggering the event because the Server is waiting for the latest data, but I'm not sure how to check if that's actually the case... or if it's how the nodata trigger works. I'm also not sure why there's so much build up of items on the proxy - the processes on the proxy all seem to be well under-utilised.
Sample utilisation graph from one proxy:
Is there anything I need to do to tune for nodata to be able to report more accurately?
nodata(/Template_Windows/agent.ping,180)<>0
The goal is to know when/if the agent becomes unavailable and send a Notification so we can investigate further.
However, despite knowing that the agent isn't becoming unavailable, I'm seeing Problems generated on this trigger for multiple hosts.
The hosts in question are in two groups - each group monitored by a different proxy, and the Queue will often show many items in the 1minute column, but rarely in the 5 minute column. I could assume that these checks are just building up in the queue and triggering the event because the Server is waiting for the latest data, but I'm not sure how to check if that's actually the case... or if it's how the nodata trigger works. I'm also not sure why there's so much build up of items on the proxy - the processes on the proxy all seem to be well under-utilised.
Sample utilisation graph from one proxy:
Is there anything I need to do to tune for nodata to be able to report more accurately?
Comment