Ad Widget

Collapse

Large queue, do not understand why

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • misch
    Junior Member
    • Jan 2010
    • 28

    #1

    Large queue, do not understand why

    Hi,

    I have a distributed Zabbix setup. Basically it is a very moderate setup wiht only 10 vps. So nothing big. But the queue is piling up and I see more than 1000 checks waiting for more than 10 minutes.

    Every 5 minutes or 1 hour Zabbix fires up the checks and it seems the proxies cannot cope the load and slowly start working on the items. Many checks "bubbles" up the queue and finaly end up in the "more than 10 minutes" bin. Within the one hour the queue is worked at an after one hour the queue is nearly empty again. In the graphs I get a very nice saw-tooth like picture.

    Of course I googled and tried to improve the things. I started a lot of pollers and the percantage of pollers occipied is less than 10%. Also ntp is installed on all systems. Our system is Zabbix 2.2.2.

    Any ideas what else is going wrong?
    Why does Zabbix not distribute the checks evenly accross the one hour time frame?

    Thanks for any hint.

    Michael Schwartzkopff.
  • ingus.vilnis
    Senior Member
    Zabbix Certified Trainer
    Zabbix Certified SpecialistZabbix Certified Professional
    • Mar 2014
    • 908

    #2
    Hi Michael,

    Would you mind to share your zabbix_server.conf and zabbix_proxy.conf files?

    Might be something with Data sender frequencies but need to make sure.

    Best Regards,
    Ingus

    Comment

    • Vlaad
      Junior Member
      • Mar 2014
      • 4

      #3
      Hi Michael,

      Check in your Zabbix log file records like:
      Code:
        5897:20150219:031600.383 value cache is fully used: please increase ValueCacheSize configuration parameter

      Comment

      • misch
        Junior Member
        • Jan 2010
        • 28

        #4
        hi,

        nothing about "cache" in the logs.

        The config of the server is:
        LogFile=/var/log/zabbix/zabbix_server.log LogFileSize=128 PidFile=/run/zabbix/zabbix_server.pid DBHost=db.example.de
        DBName=zabbix
        DBUser=zabbix
        DBPassword=XXXXXXXXX
        StartPollersUnreachable=10
        StartPingers = 10
        HousekeepingFrequency=4
        Timeout=30
        AlertScriptsPath=/var/lib/zabbix/alertscripts
        ExternalScripts=/var/lib/zabbix/externalscripts

        The config of the proxy is:
        Server=server.example.de
        ServerPort=10051
        Hostname=zabbixproxy
        LogFile=/var/log/zabbix-proxy/zabbix_proxy.log
        LogFileSize=100
        DebugLevel=3
        PidFile=/var/run/zabbix/zabbix_proxy.pid
        DBHost=127.0.0.1
        DBName=zabbix
        DBUser=zabbix
        DBPassword=XXXXXXXX
        ConfigFrequency=300
        StartPollers=64
        StartPingers=10
        HousekeepingFrequency=3
        Timeout=30
        FpingLocation=/usr/bin/fping
        Fping6Location=/usr/bin/fping6

        I have version 2.2.2 (wheezy backports) installed.

        Michael Schwartzkopff

        Comment

        • dirckcopeland
          Member
          • Oct 2013
          • 50

          #5
          Large queue, do not understand why

          Michael ,
          I am basically seeing the same thing. I'm also running 2.2.2. It just started yesterday for me and I'm not sure why. Did you ever get a resolution to your large queue? Here is a copy of my configuration for reference:
          DBName=zabbix
          DBUser=zabbix
          StartPollers=75
          StartPollersUnreachable=20
          StartPingers=10
          StartDiscoverers=50
          JavaGateway=127.0.0.1
          JavaGatewayPort=10052
          StartJavaPollers=5
          StartSNMPTrapper=1
          SNMPTrapperFile=/tmp/zabbix_traps.tmp
          CacheSize=128M
          HistoryCacheSize=512M
          TrendCacheSize=128M
          Timeout=30
          UnreachablePeriod=120
          AlertScriptsPath=/etc/zabbix/alert.d/
          ExternalScripts=/etc/zabbix/externalscripts
          FpingLocation=/usr/bin/fping
          Fping6Location=/usr/bin/fping6
          LogSlowQueries=5000
          ProxyConfigFrequency=600
          StartVMwareCollectors = 1
          VMwareCacheSize = 8M
          VMwareFrequency = 60
          Last edited by dirckcopeland; 05-03-2015, 20:25.

          Comment

          • dirckcopeland
            Member
            • Oct 2013
            • 50

            #6
            Large queue, do not understand why

            sorry folks, false alarm for my issue with large queue was due to our systems being migrated to another data center.

            Comment

            • misch
              Junior Member
              • Jan 2010
              • 28

              #7
              Solved by Upgrade to 2.4.3

              Solved by Upgrade to 2.4.3

              Comment

              Working...