Hi everybody..
I've recently proposed my colleagues to update Zabbix from 1.8.8 to 1.8.9, but we didn't have time for that until last week.
So.. the thing is: we backuped all data and reinstalled Zabbix with the new version, but when we tried to restore the data, the backup was corrupted.
I've tried a lot of stuff.. nothing worked, but its was not a big problem (I guess) because it was only the Master Node data, which we didn't have much configurations.
I've configured the server again and the hosts are taking data, but the Slave Nodes are not syncing anything with the Master Node.
Logs are showing that the data has been sent from slave and was received from the master, but there is nothing on the master frontend, and we do have some errors (NOT OK) in the logs...
SLAVE:
MASTER:
Am I missing something after the Master update?
I've found some workarounds in the forum, switching the NodeID have fixed the problem for some, but it will take so much time to test here, and we can't get our Slave Nodes in those tests right now, they have such a huge database, using another vital services that can't be paused.
I've recently proposed my colleagues to update Zabbix from 1.8.8 to 1.8.9, but we didn't have time for that until last week.
So.. the thing is: we backuped all data and reinstalled Zabbix with the new version, but when we tried to restore the data, the backup was corrupted.
I've tried a lot of stuff.. nothing worked, but its was not a big problem (I guess) because it was only the Master Node data, which we didn't have much configurations.
I've configured the server again and the hosts are taking data, but the Slave Nodes are not syncing anything with the Master Node.
Logs are showing that the data has been sent from slave and was received from the master, but there is nothing on the master frontend, and we do have some errors (NOT OK) in the logs...
SLAVE:
Code:
16423:20111213:134009.080 NODE 2: Sending history_sync of node 2 to node 1 datalen 2102 16423:20111213:134009.185 NODE 2: Sending history_uint_sync of node 2 to node 1 datalen 536 16423:20111213:134009.515 NODE 2: Sending events of node 2 to node 1 datalen 510018 16423:20111213:134017.950 NOT OK 16423:20111213:134019.344 NODE 2: Sending history_sync of node 2 to node 1 datalen 2625 16423:20111213:134019.466 NODE 2: Sending history_uint_sync of node 2 to node 1 datalen 1037 16423:20111213:134019.766 NODE 2: Sending events of node 2 to node 1 datalen 510018 16423:20111213:134028.175 NOT OK 16423:20111213:134029.524 NODE 2: Sending history_sync of node 2 to node 1 datalen 2495 16423:20111213:134029.651 NODE 2: Sending history_uint_sync of node 2 to node 1 datalen 656 16423:20111213:134029.993 NODE 2: Sending events of node 2 to node 1 datalen 510018 16423:20111213:134038.390 NOT OK 16423:20111213:134039.806 NODE 2: Sending history_sync of node 2 to node 1 datalen 2071 16423:20111213:134039.945 NODE 2: Sending history_uint_sync of node 2 to node 1 datalen 708 16423:20111213:134040.310 NODE 2: Sending events of node 2 to node 1 datalen 510018 16423:20111213:134048.841 NOT OK 16423:20111213:134049.383 NODE 2: Sending history_sync of node 2 to node 1 datalen 2889 16423:20111213:134049.570 NODE 2: Sending history_uint_sync of node 2 to node 1 datalen 623 16423:20111213:134049.987 NODE 2: Sending events of node 2 to node 1 datalen 510018
Code:
12574:20111213:134000.342 NODE 1: Received history from node 2 for node 2 datalen 2102 12574:20111213:134000.437 NODE 1: Received history_uint from node 2 for node 2 datalen 536 12573:20111213:134009.077 NODE 1: Received events from node 2 for node 2 datalen 510018 12574:20111213:134010.620 NODE 1: Received history from node 2 for node 2 datalen 2625 12574:20111213:134010.728 NODE 1: Received history_uint from node 2 for node 2 datalen 1037 12573:20111213:134019.327 NODE 1: Received events from node 2 for node 2 datalen 510018 12574:20111213:134020.809 NODE 1: Received history from node 2 for node 2 datalen 2495 12575:20111213:134020.933 NODE 1: Received history_uint from node 2 for node 2 datalen 656 12574:20111213:134029.542 NODE 1: Received events from node 2 for node 2 datalen 510018 12573:20111213:134031.068 NODE 1: Received history from node 2 for node 2 datalen 2071 12575:20111213:134031.200 NODE 1: Received history_uint from node 2 for node 2 datalen 708 12573:20111213:134039.975 NODE 1: Received events from node 2 for node 2 datalen 510018 12575:20111213:134040.690 NODE 1: Received history from node 2 for node 2 datalen 2889 12574:20111213:134040.824 NODE 1: Received history_uint from node 2 for node 2 datalen 623 12575:20111213:134049.817 NODE 1: Received events from node 2 for node 2 datalen 510018 12573:20111213:134050.393 NODE 1: Received history from node 2 for node 2 datalen 2274 12574:20111213:134050.513 NODE 1: Received history_uint from node 2 for node 2 datalen 551
I've found some workarounds in the forum, switching the NodeID have fixed the problem for some, but it will take so much time to test here, and we can't get our Slave Nodes in those tests right now, they have such a huge database, using another vital services that can't be paused.

Comment