I have many aggregate checks, like this example:
grpavg["linux","system.cpu.load[,avg1]","last","0"]
This item is on a pseudo host called "Aggregate". It successfully displays an average of every host in the group. However, if a host becomes unavailable or goes offline, that hosts last value will be continue to be displayed in the aggregate check. This is particularly bad because usually a host goes offline because its load was very high.
Since one of the host's last values was high and that host is no longer online, the average for the aggregate is still artificially high.
Is there a way to not include stale data from hosts that are offline? Unfortunately because of a limitation with Discovery Rules I can't add a rule to automatically remove the host since it's not on a subnet I can scan.
grpavg["linux","system.cpu.load[,avg1]","last","0"]
This item is on a pseudo host called "Aggregate". It successfully displays an average of every host in the group. However, if a host becomes unavailable or goes offline, that hosts last value will be continue to be displayed in the aggregate check. This is particularly bad because usually a host goes offline because its load was very high.
Since one of the host's last values was high and that host is no longer online, the average for the aggregate is still artificially high.
Is there a way to not include stale data from hosts that are offline? Unfortunately because of a limitation with Discovery Rules I can't add a rule to automatically remove the host since it's not on a subnet I can scan.


Comment