Ad Widget

Collapse

Zabbix agent item on host failed: network error

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • Wasabi-RSM
    Junior Member
    • Feb 2014
    • 11

    #1

    Zabbix agent item on host failed: network error

    Hello Zabbix Community,

    I have a problem, which has been posted multiple times before, but none of those solutions worked for my scenario.

    My Zabbix server, hosted by a "Cloud Provider" is running perfectly, but there are ALOT of the above mentioned errors popping up in the /var/log/zabbix/zabbix_server.log file. Everything is working (apart from my service discovery) - so its not like something is "broken" - but this keeps appearing, and a few secons later a message indicating that the connection has been restored pops up again.

    Would you please help me with this??
  • aib
    Senior Member
    • Jan 2014
    • 1615

    #2
    Try to play with Timeout

    Code:
    vi /etc/zabbix/zabbix_server.conf
    
    
    ### Option: Timeout
    #       Specifies how long we wait for agent, SNMP device or external check (in seconds).
    #
    # Mandatory: no
    # Range: 1-30
    # Default:
    # Timeout=3
    Sincerely yours,
    Aleksey

    Comment

    • Wasabi-RSM
      Junior Member
      • Feb 2014
      • 11

      #3
      I've done what you suggested - I've changed the TimeOut to 30 seconds? I've also removed all the hosts and had them rediscover themselves. (Its a new server, so I can afford to lose the logging data, for now...)

      Only time will tell if that worked - will update if not :-)

      Thanks for your help buddy!

      Originally posted by aib
      Try to play with Timeout

      Code:
      vi /etc/zabbix/zabbix_server.conf
      
      
      ### Option: Timeout
      #       Specifies how long we wait for agent, SNMP device or external check (in seconds).
      #
      # Mandatory: no
      # Range: 1-30
      # Default:
      # Timeout=3

      Comment

      • tchjts1
        Senior Member
        • May 2008
        • 1605

        #4
        Originally posted by Wasabi-RSM
        I've done what you suggested - I've changed the TimeOut to 30 seconds?
        Just make sure that the new value of 30 is not preceded by a comment "#". Anything commented is ignored. I leave the defaults as they are, then I add a new line for anything I modify so I can keep track of what the defaults were. So you might want to have it look like this:

        (And don't forget to restart Zabbix server process after you make any conf changes)


        ### Option: Timeout
        # Specifies how long we wait for agent, SNMP device or external check (in seconds).
        #
        # Mandatory: no
        # Range: 1-30
        # Default:
        # Timeout=3
        Timeout=30

        Comment

        • Wasabi-RSM
          Junior Member
          • Feb 2014
          • 11

          #5
          Originally posted by tchjts1
          Just make sure that the new value of 30 is not preceded by a comment "#". Anything commented is ignored. I leave the defaults as they are, then I add a new line for anything I modify so I can keep track of what the defaults were. So you might want to have it look like this:

          (And don't forget to restart Zabbix server process after you make any conf changes)


          ### Option: Timeout
          # Specifies how long we wait for agent, SNMP device or external check (in seconds).
          #
          # Mandatory: no
          # Range: 1-30
          # Default:
          # Timeout=3
          Timeout=30
          I've done what was suggested, and it seemed to have worked for a bit, but then the errors started again?

          Code:
          Zabbix agent item "net.if.out[vmxnet3 Ethernet Adapter-WFP LightWeight Filter-0000]" on host "xxxxx" failed: first network error, wait for 15 seconds
          resuming Zabbix agent checks on host "xxxxx": connection restored

          Comment

          • tchjts1
            Senior Member
            • May 2008
            • 1605

            #6
            Take a look at the last paragraph and the graphs in this post:


            If you look at yours (or post them here), it may identify any additional adjustments that could help you out.

            Comment

            • Wasabi-RSM
              Junior Member
              • Feb 2014
              • 11

              #7
              Hello,

              As requested, here are my graphs... My data gathering process was very high, running at 100%, but after following some instructions on how to start more pollers, and increase the max_connections for MySQL that was resolved.

              (PS: I'm runnig Zabbix Server 2.2.2, and Agents 2.2.1)



              Originally posted by tchjts1
              Take a look at the last paragraph and the graphs in this post:


              If you look at yours (or post them here), it may identify any additional adjustments that could help you out.
              Attached Files

              Comment

              • Wasabi-RSM
                Junior Member
                • Feb 2014
                • 11

                #8
                Here is a larger section of the server log - with time intervals ect:

                (They contain the same type of error, for 3 different hosts - the names have been substituted with "a", "b" and "c".)

                Code:
                12111:20140221:064759.374 housekeeper [deleted 0 hist/trends, 0 items, 0 events, 0 sessions, 0 alarms, 0 audit items in 0.008322 sec, idle 1 hour(s)]
                 12077:20140221:065940.685 Zabbix agent item "vfs.fs.size[H:,used]" on host "aaaaa" failed: first network error, wait for 15 seconds
                 12100:20140221:065955.617 resuming Zabbix agent checks on host "aaaaa": connection restored
                 12084:20140221:070005.183 Zabbix agent item "perf_counter["\230(_Total)\952"]" on host "aaaaa" failed: first network error, wait for 15 seconds
                 12050:20140221:070005.710 Zabbix agent item "perf_counter["\234(_Total)\1400"]" on host "bbbbb" failed: first network error, wait for 15 seconds
                 12095:20140221:070007.211 Zabbix agent item "perf_counter["\234(_Total)\210"]" on host "ccccc" failed: first network error, wait for 15 seconds
                 12100:20140221:070021.543 resuming Zabbix agent checks on host "aaaaa": connection restored
                 12100:20140221:070022.341 resuming Zabbix agent checks on host "bbbbb": connection restored
                 12100:20140221:070023.146 resuming Zabbix agent checks on host "ccccc": connection restored

                Comment

                Working...