Ad Widget

Collapse

Housekeeper & alerter more than 75% busy

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • akarim
    Member
    • Aug 2008
    • 33

    #1

    Housekeeper & alerter more than 75% busy

    Hi.

    I'm lost. On latest days, get a several problems. The symptoms are Housekeeper & alerter more than 75% busy, but result a lot of

    host unavailable, first network error, wait for 15 seconds, connection restored, host became available, and false positives.

    Zabbix 2.0.6

    conf on agents-->
    Code:
    LogFile=/tmp/zabbix_agentd.log
    LogFileSize=10
    EnableRemoteCommands=0
    LogRemoteCommands=0
    Server=144.76.XXX.XXX
    ListenPort=10050
    StartAgents=3
    ServerActive=144.76.XXX.XXX
    Hostname=XXXX.XXXX.XXXX
    RefreshActiveChecks=120
    BufferSend=30
    BufferSize=500
    MaxLinesPerSecond=100
    Timeout=30
    On server.conf
    Code:
    NodeID=0
    ListenPort=10051
    LogFile=/tmp/zabbix_server.log
    LogFileSize=10
    DBHost=localhost
    DBName=XXXX
    DBUser=XXX
    DBPassword=XXXXXXXX
    StartPollers=35
    StartIPMIPollers=0
    StartPollersUnreachable=30
    StartTrappers=30
    StartPingers=10
    StartDiscoverers=3
    StartHTTPPollers=1
    ListenIP=0.0.0.0
    HousekeepingFrequency=24
    MaxHousekeeperDelete=100000
    DisableHousekeeping=0
    CacheSize=128M
    StartDBSyncers=8
    my,cnf
    Code:
    [client]
    port		= 3306
    socket		= /var/run/mysqld/mysqld.sock
    [mysqld_safe]
    socket		= /var/run/mysqld/mysqld.sock
    nice		= 0
    [mysqld]
    user		= mysql
    pid-file	= /var/run/mysqld/mysqld.pid
    socket		= /var/run/mysqld/mysqld.sock
    port		= 3306
    basedir		= /usr
    datadir		= /var/lib/mysql
    tmpdir		= /tmp
    lc-messages-dir	= /usr/share/mysql
    skip-external-locking
    log-error = /var/log/mysqld.error.log
    max_connections = 300
    key_buffer = 384M
    max_allowed_packet = 64M
    thread_cache_size = 8
    myisam-recover         = BACKUP
    query_cache_limit	= 1M
    query_cache_size = 64M
    join_buffer_size = 1M
    expire_logs_days	= 10
    max_binlog_size         = 100M
    innodb_file_per_table = 1
    thread_concurrency = 8
    myisam_sort_buffer_size = 64M
    read_rnd_buffer_size = 8M
    read_buffer_size = 2M
    sort_buffer_size = 2M
    table_cache = 512
    [mysqldump]
    quick
    quote-names
    max_allowed_packet	= 16M
    [mysql]
    [isamchk]
    key_buffer = 256M
    write_buffer = 2M
    read_buffer = 2M
    sort_buffer_size = 256M
    !includedir /etc/mysql/conf.d/






    Apreciate help.
  • akarim
    Member
    • Aug 2008
    • 33

    #2

    Comment

    • tchjts1
      Senior Member
      • May 2008
      • 1605

      #3
      How many hosts do you have reporting in? Your screenshots are showing only an hour of time. 6 hours or 12 hours would be better to show a pattern.

      In your zabbix_server.conf file, do you have a Timeout= setting? If not, put it in and try setting it at 10. Default is 3.

      Something that strikes me as odd is that you have MaxHousekeeperDelete set that high. When housekeeper runs, it does usually hit 100% for awhile. I believe the setting default is 500? I have mine set to 300.

      With mine set like that, it runs about 10 minutes every hour. I think maybe you have housekeeper running so much that the Zabbix server can't do anything else. I have Zabbix DB and Zabbix APP on 2 different VM's and still see performance degradation in Zabbix screens and graphs when housekeeper runs.
      Attached Files

      Comment

      • tchjts1
        Senior Member
        • May 2008
        • 1605

        #4
        It would also be helpful if you showed a screenshot of your memory and swap usage (total and free) just like this from the server that has your Zabbix DB on it
        Attached Files

        Comment

        • akarim
          Member
          • Aug 2008
          • 33

          #5
          Originally posted by tchjts1
          How many hosts do you have reporting in? Your screenshots are showing only an hour of time. 6 hours or 12 hours would be better to show a pattern.





          In your zabbix_server.conf file, do you have a Timeout= setting? If not, put it in and try setting it at 10. Default is 3.
          Put 10...

          Something that strikes me as odd is that you have MaxHousekeeperDelete set that high. When housekeeper runs, it does usually hit 100% for awhile. I believe the setting default is 500? I have mine set to 300.
          10000 move to 300
          [/QUOTE]

          Thanks..

          Comment

          • akarim
            Member
            • Aug 2008
            • 33

            #6


            Comment

            • tchjts1
              Senior Member
              • May 2008
              • 1605

              #7
              So... now you're good?

              Comment

              • akarim
                Member
                • Aug 2008
                • 33

                #8
                Yes.. only

                Zabbix alerter processes more than 75% busy


                A lot of thanks.

                Comment

                • pc99096
                  Senior Member
                  • Oct 2011
                  • 193

                  #9
                  any clue what to do when the alerter process is more than 75% busy?
                  which parameter to increase? unfortunately there is no google help

                  Comment

                  • mdk
                    Junior Member
                    • Jul 2014
                    • 9

                    #10
                    Originally posted by pc99096
                    any clue what to do when the alerter process is more than 75% busy?
                    which parameter to increase? unfortunately there is no google help
                    hi,pc99096,you can set Default operation step duration on zabbix web to deal with alerter process is more than 75% busy, web->configuration->Action->Operations->Default operation step duration

                    Comment

                    • mdk
                      Junior Member
                      • Jul 2014
                      • 9

                      #11
                      how to deal with Zabbix history syncer processes more than 75% busy? I set StartDBSyncers=50

                      Comment

                      • ingus.vilnis
                        Senior Member
                        Zabbix Certified Trainer
                        Zabbix Certified SpecialistZabbix Certified Professional
                        • Mar 2014
                        • 908

                        #12
                        Hi,

                        Please don't mess with StartDBSyncers and set it back to what it was before. (4 by default)

                        Take a look at other performance issues - poller amounts and cache usages and database performance tuning.

                        Best Regards,
                        Ingus

                        Comment

                        • implode
                          Junior Member
                          • Jan 2015
                          • 1

                          #13
                          I have the same problem. My housekeeper and alerter processes are at 100% and from what I can tell, housekeeper has been over 75% for 1h 11m 19s and the alerter 1h 1m 17s. It seems that when the housekeeper kicks in, a lot of triggers may timeout and this causes an alerts snowstorm on false alerts. Even though housekeeper now is at 100%, the zabbix server CPU is still 85-90% idle. Can housekeeper and alerter processes be parallelized such that there can be more than one of each such processes.

                          Comment

                          • Slash
                            Member
                            • May 2011
                            • 64

                            #14
                            Housekeeping is a typical problem when your database begin to grow.

                            The common advice is to completely disable it and use database partitioning instead, which does the same but with little (if any) overhead.

                            Currently what the housekeeper does is that it parse your history/trend and remove items that are exceeding their maximum age (the "History storage period (in days)" and the "Trend storage period (in days)" from items configuration). As you can guest, this process become more and more heavy as your database grow... and since the housekeeper run every hour, this can become a big issue.

                            To give you an idea, I have seen some housekeeping taking more than 1 hour, at this point you are always in housekeeping and zabbix is ultra slow at all time...

                            When you use data partitionning, your table are separated in different child table, per period (one per day, one per month), at this point, you just have to drop a child table when you don't need the data any more and that's all.

                            Personally, I'm using this method with postgresql: https://www.zabbix.com/forum/showthread.php?t=33583 and there are other method on the wiki.
                            Last edited by Slash; 26-01-2015, 16:50.

                            Comment

                            Working...