Hello all,
I'm trying to figure out how I can use a distributed monitoring setup to ensure redundancy.
We have 2 data centers and both locations need to get their own Zabbix server.
I've been planning to use distributed monitoring for this and I think I know how to set this up.
One server will me the master node and the other will be the slave node, thus easing the administration by using the master node.
This way, if the connection between the locations gets severed for some reason, monitoring will still continue to work.
What I would like to do, is set up a mechanism so I can use either server as a fall back for the other in case of an hardware failure (let's assume we don't run into both problems at once
).
The best solution I can come up with is to set up MySQL replication of the databases. The database on the master node gets replicated to the slave node and vice versa.
I think I will need to use a different MySQL process for this. So I was thinking of using a process running on port 3306 for the master node and port 3307 for the slave node.
Then I will need to setup 2 zabbix config files on both servers, one with node ID 1 and the database running on port 3306 and one with node ID 2 and the database running on port 3307.
In the init scripts I would start both MySQL instances and just the primary zabbix process for the specific machine. If there is a failure on one of the zabbix servers or it needs to be brought down for maintenance, I can just activate the second zabbix process on the remaining machine.
When the original machine comes back up, I need to replicate the SQL data back to this machine and restore the state of all the processes.
Will this setup work? Is there an easier way, perhaps master-master replication of just the master node database? (saves a lot of space)
Is there anybody on the forum that has experience with a fail over scenario for Zabbix?
Any hints and tips on how I could achieve this would be most welcome!
Thanks in advance,
I'm trying to figure out how I can use a distributed monitoring setup to ensure redundancy.
We have 2 data centers and both locations need to get their own Zabbix server.
I've been planning to use distributed monitoring for this and I think I know how to set this up.
One server will me the master node and the other will be the slave node, thus easing the administration by using the master node.
This way, if the connection between the locations gets severed for some reason, monitoring will still continue to work.
What I would like to do, is set up a mechanism so I can use either server as a fall back for the other in case of an hardware failure (let's assume we don't run into both problems at once
).The best solution I can come up with is to set up MySQL replication of the databases. The database on the master node gets replicated to the slave node and vice versa.
I think I will need to use a different MySQL process for this. So I was thinking of using a process running on port 3306 for the master node and port 3307 for the slave node.
Then I will need to setup 2 zabbix config files on both servers, one with node ID 1 and the database running on port 3306 and one with node ID 2 and the database running on port 3307.
In the init scripts I would start both MySQL instances and just the primary zabbix process for the specific machine. If there is a failure on one of the zabbix servers or it needs to be brought down for maintenance, I can just activate the second zabbix process on the remaining machine.
When the original machine comes back up, I need to replicate the SQL data back to this machine and restore the state of all the processes.
Will this setup work? Is there an easier way, perhaps master-master replication of just the master node database? (saves a lot of space)
Is there anybody on the forum that has experience with a fail over scenario for Zabbix?
Any hints and tips on how I could achieve this would be most welcome!
Thanks in advance,


Comment