Ad Widget

Collapse

Zabbix 6.4.12 - frontend slowing down and proxy disconnecting every 20 minutes or so

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • intense
    Junior Member
    • Nov 2023
    • 18

    #1

    Zabbix 6.4.12 - frontend slowing down and proxy disconnecting every 20 minutes or so

    I have zabbix version 6.4.12 and I have a problem that about every 20 minutes "something" causes the zabbix frontend to slow down for a while and freeze for 1-2 minutes.
    Then disk usage increases significantly and disconnects all proxies for a while.
    Has anyone had this problem?
    Maybe it's a matter of some configuration setting?
    I'm asking for help because I've been struggling with this for several days. There is nothing special in the zabbix server logs at this point. The same in the database logs.
    I have MariaDB version 10.5.
    I have InnoDB optimizations configured.

    During this trim Queue temporarily increases significantly... and during these 20 minutes there are no problems with delayed queries...

    [mariadb-10.5]

    max_connections = 250

    innodb_buffer_pool_size=22G
    innodb_flush_method=O_DIRECT
    innodb_file_per_table=1
    innodb_flush_log_at_trx_commit=1
    innodb_log_file_size=5G
    innodb_log_buffer_size=256M
    innodb_stats_on_metadata=off
    tmp_table_size=96M
    max_heap_table_size=96M
    open_files_limit=65535
    max_connect_errors=1000000
    connect_timeout=60
    wait_timeout=28800
    skip-name-resolve=ON
    performance_schema=ON
    join_buffer_size=256K
    My virtual machine has 4 CPus (ESXi) and 32GB RAM.

    Click image for larger version  Name:	zbx1.png Views:	0 Size:	78.8 KB ID:	481341

    Click image for larger version  Name:	zbx3.png Views:	0 Size:	150.3 KB ID:	481345

    Click image for larger version

Name:	zbx4.png
Views:	738
Size:	15.0 KB
ID:	481346

    Click image for larger version  Name:	zbx2.png Views:	0 Size:	151.4 KB ID:	481344

    This is what it looks like on the chart and in the proxy log.
    What can you do every 20 minutes or so?




    Attached Files
    Last edited by intense; 25-03-2024, 22:56.
  • LenR
    Senior Member
    • Sep 2009
    • 1005

    #2
    What is the config cache reload time for the server and proxy? Mine was 20 min, but I don't remember if that is the default or if I changed it.

    Comment

    • intense
      Junior Member
      • Nov 2023
      • 18

      #3
      Originally posted by LenR
      What is the config cache reload time for the server and proxy? Mine was 20 min, but I don't remember if that is the default or if I changed it.
      I disconnected all proxies for a test and the consumption still increases every 20 minutes and it's the same...

      On server I have # this options:
      Is it wrong?

      Are you talking about another parameter?


      Click image for larger version  Name:	zbx5.png Views:	0 Size:	37.6 KB ID:	481350
      Last edited by intense; 25-03-2024, 23:52.

      Comment

      • intense
        Junior Member
        • Nov 2023
        • 18

        #4
        I did ProxyConfigFrequency=120 but it didn't help

        Is this parameter CacheUpdateFrequency?
        Last edited by intense; 25-03-2024, 23:58.

        Comment

        • intense
          Junior Member
          • Nov 2023
          • 18

          #5
          It's not the configuration's fault because I changed it to the default one and the problem still exists. Could this be the reason that my Zabbix Server is based on MariaDB 10.5 database, and all proxies are on version 5.5?
          Queue Size and others were jumping like that overnight... and then the entire frontend blocked for a moment and the proxies cut off the connections

          Click image for larger version

Name:	zbx6.png
Views:	748
Size:	143.3 KB
ID:	481359

          Comment

          • intense
            Junior Member
            • Nov 2023
            • 18

            #6
            Zabbix approximately every 20 minutes seems to lag for a moment and then Queue Size and Utilization temporarily increases and the frontend does not work. After a minute or two, everything returns to normal and runs smoothly. Something seems to be turning on every 20 minutes... maybe someone has an idea? I have no problems with Queue and they all pass nicely.

            Click image for larger version

Name:	zbx7.png
Views:	721
Size:	78.3 KB
ID:	481367
            Attached Files

            Comment

            • intense
              Junior Member
              • Nov 2023
              • 18

              #7
              I reinstalled everything, the server and all proxies. I configured Zabbix from scratch and ZABBIX still cuts out every 20 minutes for 1 minute... the interface doesn't work and utilization increases. Disconnects all proxies and reconnects them after a while. I have no strength left. What could be the reason? I have Zabbix and proxy version 6.4.13. MariaDB 10.5.22 database everywhere. InnoDB optimization done. I use everything in passive mode. Proxies are ACTIVE. I also have Zabbix agents installed on proxy machines and I have # Server Active in the agent... could this be some reason?

              utilization does not increase significantly and it still jams the interface​

              Click image for larger version

Name:	obraz.png
Views:	753
Size:	97.7 KB
ID:	481474

              Comment

              • intense
                Junior Member
                • Nov 2023
                • 18

                #8
                What could be the problem?
                I'm out of ideas​ ...

                Click image for larger version

Name:	obraz.png
Views:	717
Size:	141.5 KB
ID:	481501

                Comment

                • intense
                  Junior Member
                  • Nov 2023
                  • 18

                  #9
                  My configuration in attachment file.

                  I have VM machine 4 CPUs and 16GB RAM.

                  My optimize for MariaDB 10.5.22 InnoDB:

                  [mariadb-10.5]

                  max_connections=250

                  innodb_buffer_pool_size=8G
                  innodb_flush_method=O_DIRECT
                  innodb_log_file_size=2G
                  innodb_log_buffer_size=128M
                  performance_schema=ON
                  join_buffer_size=4M
                  skip-name-resolve=ON
                  tmp-table-size = 128M
                  max-heap-table-size = 96M
                  open_files_limit = 65535
                  max_connect_errors = 1000000
                  connect_timeout = 60
                  Help me because I have no ideas what could cause these jumps and Zabbix freezes. I turned off Housekeeper for a test and still the same thing.
                  Jamming occurs almost every 20 minutes.
                  Attached Files

                  Comment

                  • LenR
                    Senior Member
                    • Sep 2009
                    • 1005

                    #10
                    Does the monitoring.rvn.com.pl host collect data at other times? What is the UnreachablePeriod setting? I'm wondering if it could be unreachable items retrying at 20 minutes.

                    What are your housekeeping intervals? Is housekeeping active when the problem happens?

                    Comment

                    • intense
                      Junior Member
                      • Nov 2023
                      • 18

                      #11
                      Originally posted by LenR
                      Does the monitoring.rvn.com.pl host collect data at other times? What is the UnreachablePeriod setting? I'm wondering if it could be unreachable items retrying at 20 minutes.

                      What are your housekeeping intervals? Is housekeeping active when the problem happens?
                      But UnreachablePeriod I have #

                      Comment

                      • tim.mooney
                        Senior Member
                        • Dec 2012
                        • 1427

                        #12
                        Originally posted by intense
                        I have VM machine 4 CPUs and 16GB RAM.
                        Your Zabbix server, your Zabbix web front end, and your Zabbix database are all in a VM environment. That typically means multiple VMs accessing the same group of disks (or flash storage if you're lucky), so if some other VM is running an I/O intensive job every 20 minutes, it could be drastically impacting the I/O on your VM. You need to check with your vSphere administrators and have them check the VMWare built-in monitors for the storage your system is using. Make certain there is plenty of I/O availability during the period when Zabbix is not functioning correctly. It could easily be that some other VM has a scheduled task or cron job that runs every 20 minutes that is very I/O intensive. Even if it only runs for a minute, it may be impacting your VM.

                        Next, your initial VM install was 4 CPUs and 32 GB RAM, but your reinstall is 4 CPUs and 16 GB RAM. Giving the VM less RAM is not going to improve performance. At least for the database, the more RAM you can allocate to it, the better.

                        I have thoughts about several of your MariaDB tuning settings, but I think you should first verify (or rule out) whether the performance problem you're seeing is being caused by high I/O on some storage your VM shares.
                        Last edited by tim.mooney; 02-04-2024, 04:28.

                        Comment

                        Working...