Ad Widget

Collapse

Update interval is fluctuating a lot

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • aib
    Senior Member
    • Jan 2014
    • 1615

    #1

    Update interval is fluctuating a lot

    Hello!
    Our Zabbix infrastructure have ~10 proxies to monitor >4,000 hosts / >1 million Items

    We are using "SNMP agent" to pull information from CISCO devices every minute.
    Quite often, Zabbix decreases or increases polling interval dramatically (lowest update interval that I noticed was 6 seconds, highest was 1 minute 59 seconds).
    Unfortunately, some CISCO devices have an SNMP cache that updates only once per 30 seconds.
    Therefore, if Zabbix request information earlier than 30 seconds from previous try, device returning the same number and after pre-processing (multiply by 8 and save change-per-second) the wrong speed saving in database (zero bps)

    I would like to know how to decrease the limits for update interval changes, or even disable that feature and always use pre-configured Update interval (1 minute) without increasing/decreasing it.
    Sincerely yours,
    Aleksey
  • aib
    Senior Member
    • Jan 2014
    • 1615

    #2
    There are a couple of screenshots that support my initial post.
    1. Update interval configured for interface HundredGigE0/0/0/30
    Click image for larger version

Name:	image (47).png
Views:	1756
Size:	21.8 KB
ID:	453828
    2. Small piece of history showing two items together - "Change-per-second" and "RAW" data from SNMP pull. Click image for larger version

Name:	image (46).png
Views:	1772
Size:	45.6 KB
ID:	453827
    Sincerely yours,
    Aleksey

    Comment

    • Markku
      Senior Member
      Zabbix Certified SpecialistZabbix Certified ProfessionalZabbix Certified Expert
      • Sep 2018
      • 1781

      #3
      How is your StartPollers configured?

      How does the poller utilization look like?

      My understanding is that if the pollers are too busy, they need to flex the polling intervals for these passive items.

      Markku

      Comment

      • aib
        Senior Member
        • Jan 2014
        • 1615

        #4

        Utilization graph for last two days for proxy is attached.
        Average load = 82%,
        Min load = 4%
        Max load = 100%
        ​​​​​​​Click image for larger version

Name:	image.png
Views:	1757
Size:	121.0 KB
ID:	453882​
        Sincerely yours,
        Aleksey

        Comment

        • Markku
          Senior Member
          Zabbix Certified SpecialistZabbix Certified ProfessionalZabbix Certified Expert
          • Sep 2018
          • 1781

          #5
          I would add some more pollers. How is your StartPollers configured at the moment?

          Markku

          Comment

          • aib
            Senior Member
            • Jan 2014
            • 1615

            #6
            StartPollers=400
            StartPollersUnreachable=20
            Sincerely yours,
            Aleksey

            Comment

            • LenR
              Senior Member
              • Sep 2009
              • 1005

              #7
              What version of Zabbix? What database? Are you partitioned? Get the graphs of zabbix processes too. What is the zabbix NVPS?

              Are those load graphs from the zabbix server? How is the load on the proxies?

              Comment

              • aib
                Senior Member
                • Jan 2014
                • 1615

                #8
                Zabbix version is 5.0.17
                We are using Postgres DB
                Yes, we are using partitions
                Zabbix NVPS ~13,000
                Those graphs from Zabbix proxy configured for the host having polling interval issue
                Sincerely yours,
                Aleksey

                Comment

                • Markku
                  Senior Member
                  Zabbix Certified SpecialistZabbix Certified ProfessionalZabbix Certified Expert
                  • Sep 2018
                  • 1781

                  #9
                  So, my understanding is that the pollers (as configured with StartPollers) are responsible for doing the SNMP polling and assigning the timestamps to the results. The poller utilization graph from the proxy showed that that the pollers are very heavily utilized. Thus, the number of pollers should be increased, provided that the CPU capacity is sufficient, too.

                  Anyway, in this large system all the Zabbix system metrics should be fully monitored and understood to be able to keep the system running properly.

                  Markku

                  Comment

                  • LenR
                    Senior Member
                    • Sep 2009
                    • 1005

                    #10
                    What are the server specs for the problem proxy? Can you add more processors? See what processes are using the CPU on that proxy.

                    Comment

                    Working...