Hi,
I am finding a simple way to do the DR, assuming there is a solution like below:
1. Setup two zabbix server, one in product environment, one in backup environment, there is a firewall between two environments, the firewall deny the communication between agent and the backup zabbix server in normal time.
2. Configure both 2 server IP in agent.
3. Do the backup for product zabbix regularly.
4. when product zabbix breaks down, restore the backup data to backup zabbix
5. Open the ports on firewall.
6. the zabbix service comes back.
7. when product zabbix is repaired, close the firewall, back/restore data from backup zabbix to product zabbix.
The solution is easy to implement(the cost is increase RTO/RPO), but there are some questions:
1. As one of the servers is disconnected at anytime, the agent will always retry in the 'timeout' period. Then will it increase the agent process load and effect the host's performance? even effect the agent normal functions?
2. The firewall will be forced to handle a large number of requests, will it be under attack like DDOS
?
Considering these problems, it seems that use FQDN in ServerActive is a better way? When the product zabbix breaks down, only to change the IP from product server to backup server in DNS server.
- Jim
I am finding a simple way to do the DR, assuming there is a solution like below:
1. Setup two zabbix server, one in product environment, one in backup environment, there is a firewall between two environments, the firewall deny the communication between agent and the backup zabbix server in normal time.
2. Configure both 2 server IP in agent.
3. Do the backup for product zabbix regularly.
4. when product zabbix breaks down, restore the backup data to backup zabbix
5. Open the ports on firewall.
6. the zabbix service comes back.
7. when product zabbix is repaired, close the firewall, back/restore data from backup zabbix to product zabbix.
The solution is easy to implement(the cost is increase RTO/RPO), but there are some questions:
1. As one of the servers is disconnected at anytime, the agent will always retry in the 'timeout' period. Then will it increase the agent process load and effect the host's performance? even effect the agent normal functions?
2. The firewall will be forced to handle a large number of requests, will it be under attack like DDOS
?Considering these problems, it seems that use FQDN in ServerActive is a better way? When the product zabbix breaks down, only to change the IP from product server to backup server in DNS server.
- Jim
Comment