I did a search of the forums and saw that somebody had been running some development for Zabbix HA but this required the 2 ZABBIX servers to be able to share an IP address.
My setup is basically provided by a hosting company with 2 CentOS servers set up physically identical (just with different IP addresses.) I have no control over this network, so I cannot transfer a shared IP between these servers, each server has to keep its own.
Before I migrated to ZABBIX I was going to try and get this running in Nagios but never got around to it. Nagios's way of thinking was to disable active checks on the slave, set up a script to copy each check result over to the slave as they came in (Nagios doesn't use a centralised database
so the slave was always up-to-date.)
If the Nagios slave detects the master has failed, it can then enable active checks on itself and as soon as it sees the master, disable them again.
I'm thinking I can set up an agent check on my ZABBIX slave server to monitor the master. If the master goes down, the slave can then run a client-side script through the agent to start the ZABBIX server.
Similarly as soon as the master comes back up, it can stop the ZABBIX server again.
My question is, does this sound feasible? Or is there an easier way of doing this? Can two instances of the ZABBIX server happily co-exist in the same database. I'm talking 2 stand-alone instances (i.e. both node 0) - the idea being that 99% of the time only one will be running at the same time, but there may be a small window when they both run together until the failover detection kicks in.
My other question is, if the above is feasible, is it possible to utilise the slave server while it's lying dormant to run some of the master's checks on its behalf? (Perhaps using the proxy?)
Any thoughts?
Many thanks,
Andy
My setup is basically provided by a hosting company with 2 CentOS servers set up physically identical (just with different IP addresses.) I have no control over this network, so I cannot transfer a shared IP between these servers, each server has to keep its own.
Before I migrated to ZABBIX I was going to try and get this running in Nagios but never got around to it. Nagios's way of thinking was to disable active checks on the slave, set up a script to copy each check result over to the slave as they came in (Nagios doesn't use a centralised database
so the slave was always up-to-date.) If the Nagios slave detects the master has failed, it can then enable active checks on itself and as soon as it sees the master, disable them again.
I'm thinking I can set up an agent check on my ZABBIX slave server to monitor the master. If the master goes down, the slave can then run a client-side script through the agent to start the ZABBIX server.
Similarly as soon as the master comes back up, it can stop the ZABBIX server again.
My question is, does this sound feasible? Or is there an easier way of doing this? Can two instances of the ZABBIX server happily co-exist in the same database. I'm talking 2 stand-alone instances (i.e. both node 0) - the idea being that 99% of the time only one will be running at the same time, but there may be a small window when they both run together until the failover detection kicks in.
My other question is, if the above is feasible, is it possible to utilise the slave server while it's lying dormant to run some of the master's checks on its behalf? (Perhaps using the proxy?)
Any thoughts?
Many thanks,
Andy

Comment