Hello,
Some services on my network are HA (Highly Available), meaning that a few hosts down within a same cluster is not a problem, as long as at least one host stays up.
As a consequence, I would like to get alerted with a Disaster severity when all hosts in a given cluster (i.e. host group, probably?) become unreachable. How can I do that?
Ideally, I would even be alerted gradually, as availability decreases. For example:
I tried to define an item as such:
but unfortunately, when a host becomes unavailable, there's no value (nodata), as opposed to a value of 0... which means "last" is still 1 for all hosts, and so the sum stays constant.
It sounded easy at first, but it's turning out to be quite tricky! Any idea?
Some services on my network are HA (Highly Available), meaning that a few hosts down within a same cluster is not a problem, as long as at least one host stays up.
As a consequence, I would like to get alerted with a Disaster severity when all hosts in a given cluster (i.e. host group, probably?) become unreachable. How can I do that?
Ideally, I would even be alerted gradually, as availability decreases. For example:
- 50% availability: Average
- 10% availability: High
- 0% availability: Disaster
I tried to define an item as such:
Code:
grpsum["My HA service","agent.ping",last,0]
It sounded easy at first, but it's turning out to be quite tricky! Any idea?
Comment