Hey, did you think about making some kind of HA model for zabbix?
It's important thing when you manage hundreds of machines and i am sure nobody wants zabbix server to be single point of failure.
RAID/double electricity/double switches and so on is nice but..it's not it...
Right now i am thinking of HA configuration
Because of the fact that my servers are spread all over the globe, i am implementing the following WAN and LAN solutions.
WAN:
- rsync for zabbix website per x time. (just in case)
- mysql master - slave model through ssh tunnel
- Once a while to run (crontab ) script on slave server to disable all actions and check availability of master server.
So once master server is N/A (ping/telnet to zabbix server port, whatever)
it switches ON all the actions on slave mysql.
The only disadvantage of this is the master IP where all agents report. going to solve this with LVS ip-to-ip probably.
For LAN i may suggest the following:
- rsync web site
- master - slave model for mysql + IP managed by UCARP
ucarp will switch between mysql servers
- heartbeat for zabbix server IP switch
the advantage of ucarp over lvs is that it supports failover to 1 side only.
So in case master is down and then restored, it won't switch all traffic back to master. It should be done manually. (after you sync data)
Probably my solutions are not perfect..but this is what came to my mind right now when i started to think about zabbix HA over LAN and WAN.
Your suggestions are welcome
Also would be nice to know that zabbix team is planning sometime introduce HA model too.
thanks
It's important thing when you manage hundreds of machines and i am sure nobody wants zabbix server to be single point of failure.
RAID/double electricity/double switches and so on is nice but..it's not it...
Right now i am thinking of HA configuration
Because of the fact that my servers are spread all over the globe, i am implementing the following WAN and LAN solutions.
WAN:
- rsync for zabbix website per x time. (just in case)
- mysql master - slave model through ssh tunnel
- Once a while to run (crontab ) script on slave server to disable all actions and check availability of master server.
So once master server is N/A (ping/telnet to zabbix server port, whatever)
it switches ON all the actions on slave mysql.
The only disadvantage of this is the master IP where all agents report. going to solve this with LVS ip-to-ip probably.
For LAN i may suggest the following:
- rsync web site
- master - slave model for mysql + IP managed by UCARP
ucarp will switch between mysql servers
- heartbeat for zabbix server IP switch
the advantage of ucarp over lvs is that it supports failover to 1 side only.
So in case master is down and then restored, it won't switch all traffic back to master. It should be done manually. (after you sync data)
Probably my solutions are not perfect..but this is what came to my mind right now when i started to think about zabbix HA over LAN and WAN.
Your suggestions are welcome

Also would be nice to know that zabbix team is planning sometime introduce HA model too.
thanks


Comment