Ad Widget

**steveboyson** · 08-01-2014, 19:16

As we are having a comparable situation (via DRBD replicated LVM volume groups with either attached iSCSI-daemons or "switching" NFS/Samba resources used as datastores for VMware)

we solved it that way:

- created a new, third IP address for our cluster resources
- created a template "cluster" which checks disk usage etc. on our cluster/shared resources
- assigned that template to our "all-time-on" cluster IP

Then:
- created a template "cluster node"
- defined checks in the template which check heartbeat status, cluster status and all other needed metrics (e.g. when a cluster switch is performed, one cluster goes to "Secondary" while the other goes to "Primary") like cluster counters (read, write, ...), diskstate and what else is needed
- assigned that tempate to the cluster nodes

**gregmurphy** · 08-01-2014, 19:46

Thanks for the suggestion. This is the way I ordinarily would have approached the problem, but as I mentioned in my post my environment is deployed in MS Azure which doesn't allow more than one IP on each host. (Our HTTP connection failover is managed outside of the VMs by the Azure load balancer)

As I'm typing this reply though, its made me think of a way I might be able to implement that approach.

I could maybe configure a Zabbix agent endpoint on this load-balancer and make sure this endpoint fails over along with the HTTP one. Its a bit ugly, but might work, so I'll give it a try.

It would still be great if Zabbix had the concept of a cluster to avoid workarounds like this!

**steveboyson** · 08-01-2014, 23:54

Glad you've found a way. But on the other hand: what is the real relevant part of the whole story?

We decided that it is a running service as seen by our users - so we keep an eye "from the outside" what means our checks behave as if they would live outside of the cluster, not knowing anything about switched resources at all.

Of course we want to know when a cluster node switches. That is why we placed additional items on the cluster nodes to get their metrics as well.

**gregmurphy** · 09-01-2014, 14:58

I quite agree about monitoring the user experience. I think Zabbix is generally an excellent piece of software, but where it falls down (in my opinion) is that it is still too focussed on the server rather than the service.

The workaround of creating a dummy host against a virtual IP works to an extent, but doesn't cover all use cases - for example, true "outside-in" web monitoring of a load-balanced web site couldn't be done against the public IP address of the site without a Zabbix agent being available on that IP - not something you'd want to expose on a public IP address.

Ad Widget

Monitoring Clustered Items

Monitoring Clustered Items

Comment

Comment

Comment

Comment