Ad Widget

Collapse

Zabbix randomly stops collecting data

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • tsit
    Junior Member
    • Aug 2023
    • 4

    #1

    Zabbix randomly stops collecting data

    Hello,

    we have the following problem: we recently started using Zabbix to monitor our network and servers. However, at some point no data is collected, like today at 03:00 at night. At some point the server starts collecting data again, but not completely either. This error is gone after a reboot of the machine on which Zabbix is running. This problem occurs every night, at different times. We first suspected that this is due to the VMware snapshot that Veeam creates to back up the virtual machine. However, we were able to rule this out when we excluded the monitoring VM from the backup as a test. ICMP pings are also not affected, they continue to work normally.



    Best regards​
  • MADxHAWK
    Member
    • Apr 2011
    • 41

    #2
    Are you maybe monitoring a Cisco "Small Buisiness" switch?
    With those it seems to be a common Problem as you can see below
    So far i never saw this behavior on any other Cisco devices or Linux Server im monitoring

    You can try to increase the timeouts in /etc/zabbix/zabbix_server.conf

    Code:
    ### Option: Timeout
    #    Specifies how long we wait for agent, SNMP device or external check (in seconds).
    #
    # Mandatory: no
    # Range: 1-30
    # Default:
    # Timeout=3
    
    Timeout=10
    ​​
    There are also timeout settings for VM-Ware and Trapper.

    This might help (well it did not for Cisco Small Buisiness though, but i think thats a Cisco Small Buisiness firmware Problem not a problem with Zabbix)

    ​​Click image for larger version

Name:	image.png
Views:	1535
Size:	40.1 KB
ID:	468986

    Comment

    • tsit
      Junior Member
      • Aug 2023
      • 4

      #3
      Thank you for the quick response.

      If my problem occurs, it affects all sensors across all hosts except for icmp pings for multiple hours.

      We are not using any Cisco hardware at all.

      Comment

      • MADxHAWK
        Member
        • Apr 2011
        • 41

        #4
        Thats strange. It looks like zabbix get no response or a not expected response from the item monitored and disable the monitoring:
        Can you check the logfiles ( /var/log/zabbix/zabbix_server.log on linux) for entries like

        Code:
        [timestamp] item "Zabbix - Replication Slave - MySQL:mysql.seconds_behind_master["xx.xx.xx.xx"]" became not supported: Value "" of type "string" is not suitable for value type "Numeric (unsigned)"
        after some time you will see

        Code:
        [timestamp] item "Zabbix - Replication Slave - MySQL:mysql.seconds_behind_master["xx.xx.xx.xx"]" became supported


        In this case Zabbix received for what reason ever a string or empty value instead of a numeric value.

        This might give a clue why the data is not received or stored for some time.
        Last edited by MADxHAWK; 21-08-2023, 09:09.

        Comment

        • tsit
          Junior Member
          • Aug 2023
          • 4

          #5
          it affects all sensors across all hosts. zabbix just stops collecting data for everything except for ICMP.

          only messages i have in log are the following
          Code:
             667:20230821:082354.408 cannot send list of active checks to "127.0.0.1": host [Zabbix server] not found
          but these doesnt seem to affect the missing data of the whole monitoring

          Comment

        Working...