Ad Widget

Collapse

version 1.6.3 Distributed Node

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • ataylo13
    Senior Member
    • Feb 2007
    • 122

    #1

    version 1.6.3 Distributed Node

    I am seeing an issue when the remote node fires it sends config changes

    Remote:
    30345:20090327:180325 NODE 4: Sending configuration changes to master node 1 for node 4 datalen 11504217

    Master
    21645:20090327:180326 NODE 1: Received data from slave node 4 for node 4 datalen 11504216


    After that i see no updates between nodes... If I restart the remote node I see:

    Remote:
    31160:20090327:181939 NODE 4: Sending configuration changes to master node 1 for node 4 datalen 11506590

    Master:
    Nothing

    If I restart the master and then the remote node I will see similar logs, then nothing.


    I was seeing regular traffic between the two nodes before I added a bunch of servers to the remote node.

    Any/Ideas?
    Version : 1.8.8
    Current Configuration 1 Master and 3 Child Nodes
  • ataylo13
    Senior Member
    • Feb 2007
    • 122

    #2
    When I restarted the master this came up in the logs on the slave:

    31160:20090327:182702 NODE 4: Error while receiving answer from Node [1] error: ZBX_TCP_READ() failed [Connection reset by peer]
    31160:20090327:182702 NODE 4: Sending history_sync of node 4 to node 1 datalen 345912
    31160:20090327:182704 NODE 4: Error while sending data to Node [1] error: ZBX_TCP_WRITE() failed [Connection reset by peer]
    31160:20090327:182704 NODE 4: Sending history_uint_sync of node 4 to node 1 datalen 345126
    31160:20090327:182704 NODE 4: Unable to connect to Node [1] error: Cannot connect to [10.34.66.195:10051] [Connection refused]
    31160:20090327:182704 NODE 4: Sending history_str_sync of node 4 to node 1 datalen 13628
    31160:20090327:182704 NODE 4: Unable to connect to Node [1] error: Cannot connect to [10.34.66.195:10051] [Connection refused]


    I put the Master in debug and the only thing that jumps out at me is every 10s this pops up in the logs:

    Starting sync with nodes
    Last edited by ataylo13; 28-03-2009, 01:31.
    Version : 1.8.8
    Current Configuration 1 Master and 3 Child Nodes

    Comment

    • borsss
      Junior Member
      • Nov 2008
      • 3

      #3
      same problem here

      hi,

      very same problem here, slave is 164 upgraded from 163, master is 164 (native)

      on slave:
      29472:20090506:123559 NODE 2: Sending configuration changes to master node 1 for node 2 datalen 13267145

      on master:
      22828:20090506:123559 NODE 1: Received data from slave node 2 for node 2 datalen 13267144

      and then nothing.

      Any ideas?...

      Comment

      • xs-
        Senior Member
        Zabbix Certified Specialist
        • Dec 2007
        • 393

        #4
        similar here.

        Hoping for a fix from devs soon.
        I have a child node out of sync for a month now =\

        Comment

        • borsss
          Junior Member
          • Nov 2008
          • 3

          #5
          fugured it out.

          initial config sync was taking very long (I have 600 hosts).

          I was misreading the debug info..

          Comment

          Working...