Hi! I wanted to share with you all my current setup for a two node active/active zabbix setup. It's not perfect but for now that’s the best I was able to do, maybe some of will be able to help out.
Important note: this setup does not permit you to do active check on both servers just the first one mention in your zabbix agent config file. (it does not bother me as i don’t really use that function)
SO we have 2 zabbix server running zabbix 1.8.2 with a mysql DB
we have about 500 host .
We wanted to be able to have both server running at the same time since there in different site, that way we have a better monitoring(since something could be seen down in one site but not the other)
What i did was enable mysql replication of the zabbix database from zabbix1 to zabbix2. I limited the replication to the following table:
applications
autoreg_host
conditions
config
dcchecks
dhosts
drules
dservices
escalations
expressions
functions
globalmacro
graphs
graphs_items
graphs_theme
groups
hosts
hosts_groups
hosts_profiles
hosts_profiles_ext
hosts_templates
hostmacro
httpstep
httpstepitem
httptest
images
items
items_applications
maintenances
maintenances_groups
maintenances_hosts
maintenances_windows
media
opconditions
operations
opmediatypes
profiles
regexps
rights
screens
screens_items
scripts
service_alarms
services
services_links
services_times
sysmaps
sysmaps_elements
sysmaps_link_triggers
sysmaps_links
timeperiods
trigger_depends
triggers
users
users_groups
usrgrp
So basically these are the "configuration" table
The other tables were not included, because they contain the monitoring data.
So if i change a setting in zabbix1 it get replicated to zabbix2.
But, I had to limit the replication to a 2 hour windows during the night, because running the replication all the time is rendering zabbix2 (the slave) useless
All of this (I believe) because of the items table.
You see this table contain row called lastclock, lastvalue, prevvalue, and so basically monitoring data. so when this data is replicated to zabbix2 it will cause various problem, like polling the host more than he is suppose to (because of the lastclock value is being replicated) alert not being sent ect....
Running it during the night also have its disadvantage. One of them being that the slave server will check the host a lot more and that any time limit in the item will not work (since the server will think it’s another hour becasue of the lastclock...)any alert with a avg won’t work either because if the server was suppose to check the data once evey 5 min, now he might do it one every 30 (again because of the lastclock )
What I would like is to be able to leave the replication running all the time that way if I change the configuration on my main server (zabbix1) the configuration will be automatically replicated to zabbix2
Anyway just a first post to see what people have for active/active setup and maybe point me in the right direction.
Also if anyone is interested i can post my config in detail.
Important note: this setup does not permit you to do active check on both servers just the first one mention in your zabbix agent config file. (it does not bother me as i don’t really use that function)
SO we have 2 zabbix server running zabbix 1.8.2 with a mysql DB
we have about 500 host .
We wanted to be able to have both server running at the same time since there in different site, that way we have a better monitoring(since something could be seen down in one site but not the other)
What i did was enable mysql replication of the zabbix database from zabbix1 to zabbix2. I limited the replication to the following table:
applications
autoreg_host
conditions
config
dcchecks
dhosts
drules
dservices
escalations
expressions
functions
globalmacro
graphs
graphs_items
graphs_theme
groups
hosts
hosts_groups
hosts_profiles
hosts_profiles_ext
hosts_templates
hostmacro
httpstep
httpstepitem
httptest
images
items
items_applications
maintenances
maintenances_groups
maintenances_hosts
maintenances_windows
media
opconditions
operations
opmediatypes
profiles
regexps
rights
screens
screens_items
scripts
service_alarms
services
services_links
services_times
sysmaps
sysmaps_elements
sysmaps_link_triggers
sysmaps_links
timeperiods
trigger_depends
triggers
users
users_groups
usrgrp
So basically these are the "configuration" table
The other tables were not included, because they contain the monitoring data.
So if i change a setting in zabbix1 it get replicated to zabbix2.
But, I had to limit the replication to a 2 hour windows during the night, because running the replication all the time is rendering zabbix2 (the slave) useless
All of this (I believe) because of the items table.
You see this table contain row called lastclock, lastvalue, prevvalue, and so basically monitoring data. so when this data is replicated to zabbix2 it will cause various problem, like polling the host more than he is suppose to (because of the lastclock value is being replicated) alert not being sent ect....
Running it during the night also have its disadvantage. One of them being that the slave server will check the host a lot more and that any time limit in the item will not work (since the server will think it’s another hour becasue of the lastclock...)any alert with a avg won’t work either because if the server was suppose to check the data once evey 5 min, now he might do it one every 30 (again because of the lastclock )
What I would like is to be able to leave the replication running all the time that way if I change the configuration on my main server (zabbix1) the configuration will be automatically replicated to zabbix2
Anyway just a first post to see what people have for active/active setup and maybe point me in the right direction.
Also if anyone is interested i can post my config in detail.
Comment