Hi all,
I am running Zabbix 2.2.3 on my server, proxies, and agents.
I was curious as to what people consider to be a recommended strategy for implementing a failover/cutover strategy for a distributed setup with one server and multiple proxies. In our situation, the proxies are handling most/all of the monitoring and the node itself just collects the monitoring results from the proxies.
In the event that a proxy fails or becomes unresponsive, the hosts that are monitored by that proxy will no longer be monitored (at least, that's my take on it from experimenting around with this in a test setup). I imagine their may be a slight data loss associated with this (any data stored in the proxy's DB will not be sent to the server until it becomes responsive again). Ideally, though, those hosts that are monitored by that unresponsive proxy would be switched dynamically to a standby proxy that will continue to monitor until the first proxy is available again (at which time, perhaps manually, we would move the hosts back over to the original master).
We are currently taking a passive approach (the proxies poll the various agents, no ServerActive setting is being used) so hosts can be easily assigned or switched from one proxy to another. I imagine it might be doable with a proxy heartbeat trigger that activates a script that does the host-to-proxy reassignment, but I wanted to get people's input on this first.
I've seen some details on HA with regard to Zabbix, but nothing indicating HA with regard to proxies (just the server itself and the database).
Thanks in advance,
Ryan Younce
I am running Zabbix 2.2.3 on my server, proxies, and agents.
I was curious as to what people consider to be a recommended strategy for implementing a failover/cutover strategy for a distributed setup with one server and multiple proxies. In our situation, the proxies are handling most/all of the monitoring and the node itself just collects the monitoring results from the proxies.
In the event that a proxy fails or becomes unresponsive, the hosts that are monitored by that proxy will no longer be monitored (at least, that's my take on it from experimenting around with this in a test setup). I imagine their may be a slight data loss associated with this (any data stored in the proxy's DB will not be sent to the server until it becomes responsive again). Ideally, though, those hosts that are monitored by that unresponsive proxy would be switched dynamically to a standby proxy that will continue to monitor until the first proxy is available again (at which time, perhaps manually, we would move the hosts back over to the original master).
We are currently taking a passive approach (the proxies poll the various agents, no ServerActive setting is being used) so hosts can be easily assigned or switched from one proxy to another. I imagine it might be doable with a proxy heartbeat trigger that activates a script that does the host-to-proxy reassignment, but I wanted to get people's input on this first.
I've seen some details on HA with regard to Zabbix, but nothing indicating HA with regard to proxies (just the server itself and the database).
Thanks in advance,
Ryan Younce
Comment