Ad Widget

**chrisf** · 24-07-2009, 01:20

Additional info

Looks like this is an issue with the configs getting out of sync.
But I have no idea how as I make all config changes on the master node.

I've dropped the Debug level to 3 and what I see is:

16820:20090723:192157 NODE 1: Received data from slave node 2 for node 2 datalen 3416571

Here's some dupe info:
15078:20090723:185542 [Z3005] Query failed: [1062] Duplicate entry '200200000010073-vfs.fs.inode[/,free]' for key 2 [insert into items (itemid,type,snmp_community,snmp_oid,snmp_port,hos tid,description,key_,delay,history,trends,status,v alue_type,trapper_hosts,units,multiplier,delta,snm pv3_securityname,snmpv3_securitylevel,snmpv3_authp assphrase,snmpv3_privpassphrase,formula,error,logt imefmt,templateid,valuemapid,delay_flex,params,ipm i_sensor) values (200200000026608,0,'','',161,200200000010073,'Free number of inodes on $1','vfs.fs.inode[/,free]',60,7,365,0,3,'','',0,0,'',0,'','','0','','',2001 00000000030,0,'','','')]

15078:20090723:185522 [Z3005] Query failed: [1062] Duplicate entry '200100000000056-200100000000001' for key 2 [insert into hosts_templates (hosttemplateid,hostid,templateid) values (200200000000090,200100000000056,200100000000001)]

Now I tried deleting everything from these tables and restarting both systems to see if the sync would restore the data from the slave to the master... then I could restore the dump I made and ignore the failed inserts.

But after a restart the tables remained empty. Which makes me believe the Master is no longer attempting to insert the data from the Slave.

I verified this by restoring the items and hosts_templates tables and restart the master and slave again.

We see the familiar:
16820:20090723:192157 NODE 1: Received data from slave node 2 for node 2 datalen 3416571

But I no longer see the duplicate insert errors, which would mean the master is ignoring the config from the slave.

Can someone point me in the right direction here? Is there a procedure to delete the child and "resync"?

**xaeth** · 24-07-2009, 17:48

we just started having this issue this week as well. any responses would be great.

Another we were getting with it is:

NODE 1: Error while receiving answer from Node [2] error: ZBX_TCP_READ() failed [Interrupted system call]

and

12475:20090724:114721 NODE 2: Unable to connect to Node [1] error: Cannot connect to [156.132.82.26:10051] [Connection refused]
12475:20090724:115037 NODE 2: Error while receiving answer from Node [1] error: ZBX_TCP_READ() failed [Connection reset by peer]
12475:20090724:115107 NODE 2: Error while sending data to Node [1] error: ZBX_TCP_WRITE() failed [Connection reset by peer]
12475:20090724:115107 NODE 2: Unable to connect to Node [1] error: Cannot connect to [156.132.82.26:10051] [Connection refused]

The network connectivity between the 2 nodes looks fine, so this is very confusing.

Ad Widget

Node 1 and Node 2 fail to sync history

Node 1 and Node 2 fail to sync history

Comment

Comment