I inherited a Zabbix 5 environment that supports 7000 nodes. There are two Zabbix servers (one in each datacenter). Daily the database is backed up and transferred to the stand-by server and imported which takes about 12 hours.
If the Zabbix server losses connectivity to the network or the database becomes extremely busy it generates 1000s of ticket
I need to remove single points of failure and make this HA so I was leaning towards rebuilding the environment using Zabbix 6.x, redundant DB, Zabbix, & Frontend servers.
First question, is this doable across datacenters (DR) or am I tempting fate by causing a split brain if the datacenters loss connectivity?
Or am I better off building out in the main datacenter and just putting a proxy in the smaller datacenter?
Does anyone know of a best-practice architectural document that might cover this?
Thanks
If the Zabbix server losses connectivity to the network or the database becomes extremely busy it generates 1000s of ticket
I need to remove single points of failure and make this HA so I was leaning towards rebuilding the environment using Zabbix 6.x, redundant DB, Zabbix, & Frontend servers.
First question, is this doable across datacenters (DR) or am I tempting fate by causing a split brain if the datacenters loss connectivity?
Or am I better off building out in the main datacenter and just putting a proxy in the smaller datacenter?
Does anyone know of a best-practice architectural document that might cover this?
Thanks
Comment