Ad Widget

**Markku** · 06-01-2023, 11:18

These are always interesting cases for learning. What is your Zabbix version?

The documentation (https://www.zabbix.com/documentation.../zabbix_server) for StartDBSyncers says "default value (4) should be enough to handle up to 4000 NVPS", have you tried increasing it to 5 to improve history syncer situation?

How is your database performing, do you have statistics for it? (I'm not a professional DBA but maybe someone can comment on those.)

There is some contention with unreachable pollers, try increasing it moderately as well.

How are your items actually configured? Because you have active proxies but still your pollers (on the server) are in high utilization. How does increasing pollers even more (but moderately) affect your case?

Markku

**medl** · 06-01-2023, 11:51

Poller utilization goes to 100% - may interpretation would be that there are either additional checks running, a latency or timing issue.
What Timeout do you have configured?

I observed that when Zabbix is using SNMP Bulk requests - if they fail (timeout or similar) - zabbix may fall back to standard SNMP requests.
This puts a lot of pressure on the Poller and on restart it will do bulk requests again. (Observed Behaviour in Version 6.0.8)
Your Problem sounds similar but here i just assume that you have SNMP Workload. The unreachable Poller Spike fits into that picture.

Once i had identified the unreliable Devices i deployed a special Proxy for those so they don't clog up regular processing.
Updateing SNMP Information on a unrealiable Device also helps (sometimes). Standard Requests also put a lot of pressure on the Target Devices - which in turn slow down Processing even more.
So far i have not found a way to force Zabbix to keep doing bulk requests.

You can identify those devices by checking the Zabbix Queue.
Side note: Atm i suspect that my installation will not survive a larger Network Outage because if this.

Back to your case - you said you have 6 Proxies, the Graphs show that the Server is also Polling (and maybe runs into the issue i described).
Personally i'd recommend to offload as much as possible to the proxies, that keeps the server and frontend "clean and working" even if there are Problems in processing.

Do you also have utilization Graphs from your Proxies?

**ChenAvi** · 08-01-2023, 11:22

Hi, thank you for your answers, I'll try to implement your suggestions.
Our version is zabbix_server (Zabbix) 5.0.26.
Regarding the DB, we haven't noticed any performance issues with it. If I'm not mistaking the timeout we have is 30 seconds and we don't have graphs for the proxies.

Chen

**cyber** · 09-01-2023, 10:38

Considering the number of hosts (just below 2k) and NVPS of 5117, you do A LOT of polling... And that shows... pollers are busy... config and history syncers are busy... But as suggested, unload ALL polling from server. I would also recheck all the intervals of checking... Do you really need them as often as you do (and I don't even know here, how often you check..

)

Ad Widget

Zabbix deployment scaling issues advise

Zabbix deployment scaling issues advise

Comment

Comment

Comment

Comment