Ad Widget

**kloczek** · 03-04-2018, 19:03

1)Number of monitored hosts has nothing to do with zabbix performance. Only relevant factor is NVPS or effective bandwidth of the monitoring data. You may have 1000 hosts with one metric per host or one host with 1000 metrics and from performance point of view those two cases will be equal. 2) Using horizontally scaled active-active DB backend has its own performance impact and using such setup will not scale bond some flow to/into DB backennd 3) look on your DB backend hosts IO statistics.

**bjornskau** · 10-04-2018, 09:26

Originally posted by kloczek

1)Number of monitored hosts has nothing to do with zabbix performance. Only relevant factor is NVPS or effective bandwidth of the monitoring data. You may have 1000 hosts with one metric per host or one host with 1000 metrics and from performance point of view those two cases will be equal. 2) Using horizontally scaled active-active DB backend has its own performance impact and using such setup will not scale bond some flow to/into DB backennd 3) look on your DB backend hosts IO statistics.

My NVPS is 28827.97

Here is output of pt-diskstats for
db02 (which acts as primary at time of testing)
pt-diskstats --interval=20 --iterations=3

#ts device rd_s rd_avkb rd_mb_s rd_mrg rd_cnc rd_rt wr_s wr_avkb wr_mb_s wr_mrg wr_cnc wr_rt busy in_prg io_s qtime stime
14.6 sda 0.0 0.0 0.0 0% 0.0 0.0 108.0 6.7 0.7 6% 0.0 0.3 3% 0 108.0 0.0 0.2
14.6 sda3 0.0 0.0 0.0 0% 0.0 0.0 47.9 15.1 0.7 13% 0.0 0.4 1% 0 47.9 0.1 0.3
14.6 dm-0 0.0 0.0 0.0 0% 0.0 0.0 112.2 6.4 0.7 0% 0.0 0.3 3% 0 112.2 0.1 0.2

20.0 sda 0.0 0.0 0.0 0% 0.0 0.0 167.2 7.2 1.2 4% 0.1 0.3 4% 0 167.2 0.1 0.2
20.0 sda3 0.0 0.0 0.0 0% 0.0 0.0 74.6 16.0 1.2 9% 0.0 0.4 2% 0 74.6 0.1 0.3
20.0 dm-0 0.0 0.0 0.0 0% 0.0 0.0 171.8 7.0 1.2 0% 0.1 0.3 4% 0 171.8 0.1 0.2

20.0 sda 0.4 512.0 0.2 0% 0.0 19.4 493.8 3.7 1.8 2% 0.1 0.3 11% 0 494.2 0.0 0.2
20.0 sda3 0.4 512.0 0.2 0% 0.0 19.4 192.1 9.6 1.8 5% 0.1 0.3 6% 0 192.5 0.1 0.3
20.0 dm-0 0.4 512.0 0.2 0% 0.0 19.5 500.4 3.7 1.8 0% 0.1 0.3 11% 0 500.8 0.1 0.2

db01 (which acts as secondary):
#ts device rd_s rd_avkb rd_mb_s rd_mrg rd_cnc rd_rt wr_s wr_avkb wr_mb_s wr_mrg wr_cnc wr_rt busy in_prg io_s qtime stime
11.9 sda 0.1 16.0 0.0 0% 0.0 8.0 748.6 2.4 1.8 0% 0.3 0.3 17% 0 748.7 0.1 0.2
11.9 sda2 0.0 0.0 0.0 0% 0.0 0.0 0.2 4.0 0.0 0% 0.0 0.5 0% 0 0.2 0.0 0.5
11.9 sda3 0.1 16.0 0.0 0% 0.0 8.0 179.2 10.2 1.8 1% 0.1 0.4 5% 0 179.3 0.1 0.3
11.9 dm-0 0.1 16.0 0.0 0% 0.0 8.0 462.0 4.0 1.8 0% 0.2 0.4 16% 0 462.1 0.1 0.4

20.0 sda 0.0 0.0 0.0 0% 0.0 0.0 744.9 2.2 1.6 0% 0.2 0.3 16% 0 744.9 0.1 0.2
20.0 sda2 0.0 0.0 0.0 0% 0.0 0.0 0.0 0.0 0.0 0% 0.0 0.0 0% 0 0.0 0.0 0.0
20.0 sda3 0.0 0.0 0.0 0% 0.0 0.0 175.7 9.2 1.6 0% 0.1 0.4 5% 0 175.7 0.1 0.3
20.0 dm-0 0.0 0.0 0.0 0% 0.0 0.0 456.8 3.5 1.6 0% 0.2 0.4 16% 0 456.8 0.0 0.3

20.0 sda 0.0 0.0 0.0 0% 0.0 0.0 696.3 2.8 1.9 0% 0.2 0.3 15% 0 696.3 0.1 0.2
20.0 sda2 0.0 0.0 0.0 0% 0.0 0.0 0.0 0.0 0.0 0% 0.0 0.0 0% 0 0.0 0.0 0.0
20.0 sda3 0.0 0.0 0.0 0% 0.0 0.0 169.2 11.5 1.9 0% 0.1 0.4 5% 0 169.2 0.1 0.3
20.0 dm-0 0.0 0.0 0.0 0% 0.0 0.0 429.3 4.5 1.9 0% 0.2 0.4 15% 0 429.3 0.1

**kloczek** · 12-04-2018, 15:01

"IO statistics"-> number of IO/s (read and write)
Do you have disks IO monitoring on these hosts?

**bjornskau** · 12-04-2018, 15:03

Originally posted by kloczek

"IO statistics"-> number of IO/s (read and write)
Do you have disks IO monitoring on these hosts?

Actually, no.(

**kloczek** · 13-04-2018, 10:19

Originally posted by bjornskau

Actually, no.(

Usually I'm calling this kind of situations as "shoemaker without shoes"

If you are using standard OOTB zabbix templates and your DB backend is working under Linux it is not your (kind of) fault as those templates does not provide IO statistics.
If you want you can use mine "OS Linux" template
It has additional "DSK:" LLD which adds 4 items prototypes:

{#DISK}::read::bytes vfs.dev.read[{#DISK},sectors]
{#DISK}::read::IOs vfs.dev.read[{#DISK},operations]
{#DISK}::write::bytes vfs.dev.write[{#DISK},sectors]
{#DISK}::write::IOs vfs.dev.write[{#DISK},operations]

two graphs prototypes: DSK::{#DISK}::bytes, DSK::{#DISK}::IOs.
And one screen "DSK" which presents all graphs together.
Feel free to use it/criticise and/or contribute

Ad Widget

Zabbix 3.2 is extremely slow with 20000 monitored hosts.

Zabbix 3.2 is extremely slow with 20000 monitored hosts.

Comment

Comment

Comment

Comment

Comment