Ad Widget

**kernbug** · 14-06-2018, 10:48

Hello

Do you use partitioning?
Size of the DB?

Could you provide the following graphs?

Zabbix server performance
Zabbix internal process busy
Zabbix data gathering process busy
Zabbix cache usage

**zabbixfk** · 18-06-2018, 12:52

Thank you for the reply.
No partitioning done on mysql.
Size of the DB is ~ 176GB

Zabbix Cache usage 7 days -

zabbix server performance 7days

zabbix zabbix internal process busy 7days

zabbix data gathering process 7 days

Thanks

**kernbug** · 19-06-2018, 06:29

Originally posted by zabbixfk

Thank you for the reply.
No partitioning done on mysql.
Size of the DB is ~ 176GB

Zabbix Cache usage 7 days -

zabbix server performance 7days

zabbix zabbix internal process busy 7days

zabbix data gathering process 7 days

Thanks

Thank you for the additional information.

Could you gather information from strace (about 5-10m):

Code:

strace -s 100 -T -tt -fp PID -e trace=write

where PID is the 'history syncer' process id in the system.

**zabbixfk** · 19-06-2018, 12:42

Here is the file.

**kernbug** · 19-06-2018, 13:54

Originally posted by zabbixfk

Here is the file.

Thank you, nothing criminal except:

Code:

12:24:43.122005 write(6, " 24297:20180619:122443.121 __zbx_zbx_setproctitle() title:'history syncer #1 [synced 80 items in 0.2"..., 124) = 124 <0.000008>
12:24:44.122502 write(6, " 24297:20180619:122444.122 __zbx_zbx_setproctitle() title:'history syncer #1 [synced 80 items in 0.2"..., 129) = 129 <0.000014>
12:24:44.122715 write(6, " 24297:20180619:122444.122 In DCsync_history() history_num:21\n", 62) = 62 <0.000008>
12:25:21.681305 write(6, " 24297:20180619:122521.681 __zbx_zbx_setproctitle() title:'history syncer #1 [synced 18 items in 0.9"..., 124) = 124 <0.000008>
12:25:22.681851 write(6, " 24297:20180619:122522.681 __zbx_zbx_setproctitle() title:'history syncer #1 [synced 18 items in 0.9"..., 129) = 129 <0.000009>

Please, reduce:

Code:

 
 StartDBSyncers=50 -> 4  
 StartPollers=275->150  
 StartTrappers=130 -> 30

And if possible apply partitioning, but backup first.

**zabbixfk** · 19-06-2018, 14:46

Thanks and really appreciate your reply. Wanted to ask how do you come on these numbers, can you help me figure out same?
And how do i achieve partition? Only on server or all the proxies?
I am new to this mysql things, tuned all the parameters from the help from internet - would be great if you could share some points.
Doesn't decreasing the pollers affect the incoming data, some other thread they mentioned to increase the pollers whenever number of hosts/itmes from that server/proxy increase, so i kept it this number - not sure if i am doing right - can you through some lights here please.

Thanks

**kernbug** · 20-06-2018, 10:40

Hello

Originally posted by zabbixfk

Wanted to ask how do you come on these numbers, can you help me figure out same?

Sleepless nights with ~40000 hosts and ~10k nvps and I'm still so far from Zabbix expert level

Originally posted by zabbixfk

And how do i achieve partition? Only on server or all the proxies?

Mostly partitioning of the Zabbix Server DB just enough. But if you want Zabbix Proxy DB also may be partitioned (only proxy_history table).[/QUOTE]

Look here about setup instructions: https://zabbix.org/wiki/Docs/howto/mysql_partition

Originally posted by zabbixfk

I am new to this mysql things, tuned all the parameters from the help from internet - would be great if you could share some points.
Doesn't decreasing the pollers affect the incoming data, some other thread they mentioned to increase the pollers whenever number of hosts/itmes from that server/proxy increase, so i kept it this number - not sure if i am doing right - can you through some lights here please.

Thanks

Zabbix components are well optimized and performance is great, but sometimes there is a bottleneck you need to cary on. Just increasing number of the process will increase locks between them, for example StartDBSyncers. I saw only few DB server configs that can survive with StartDBSyncers>30. Simple rule - start small, monitor your load, increase parameters(one by one, not all of them), find bottlenecks.

**zabbixfk** · 20-06-2018, 11:49

Thanks for the reply.
I am trying to do changes you suggested ( decreasing pollers etc) on one of the proxies and seeing queue is being reduced in all the columns. But it suddenly gets increased stays there for some time again decreases, should i be worried? ( p.s theres still >90 items under more than 10 mins column on that proxy.

)
I could restart master only once , and queue under all columns is decreased

again same case, it gets increased for every refresh i do , not consistent. - i believe to apply changes in config, i have to restart zbx_server.
What is the best way to measure these things? And any documents you suggest me to read up on understanding what works better ( for my environment )- how many pollers needed for how many hosts / items etc - asking this coz you said about increasing process won't help - just a request though. Any books/articles - their manuals doesn't provide anything on this part.
- I can't do db partitioning server db now, that will need a downtime, let me see if that's possible, and for 170+G size don't know how long will it take.
BTW i am running all in one. Zbx server, frontend, db all in one - should i think on moving out frontend or db to some other machine? Will it help? What kind of connectivity is required between db and server & httpd - is 1G enough? physical/virtual? Any preffred combinations you can think of?
- ANy idea on taking backups? I had setup master slave replication to one of the boxes, so only db backup i have. Hope that's enough

OK i am asking too many things, really thankful to you for patiently pointing out me the directions

Thanks

**kernbug** · 20-06-2018, 13:24

Originally posted by zabbixfk

Thanks for the reply.
I am trying to do changes you suggested ( decreasing pollers etc) on one of the proxies and seeing queue is being reduced in all the columns. But it suddenly gets increased stays there for some time again decreases, should i be worried?

Overall queue mostly depends on the Zabbix Server process, proxy push new values to server -> server (history_syncer) must write them down to the database;

And any documents you suggest me to read up on understanding what works better ( for my environment )- how many pollers needed for how many hosts / items etc - asking this coz you said about increasing process won't help - just a request though.

Increase pollers as you grow, if load on the performance graphs about 75% add 10 pollers, restart, check. 1 DB syncer perform max 1000 values write at once to the database, if this 1 operation happens in <500ms - performance of your database is enough. It's better to have test site with the same version of your main setup.

BTW i am running all in one. Zbx server, frontend, db all in one - should i think on moving out frontend or db to some other machine? Will it help? What kind of connectivity is required between db and server & httpd - is 1G enough? physical/virtual? Any preffred combinations you can think of?

Just enough setup 2 node cluster: replicated database, 1 server with fencing, web frontend on each node.

- ANy idea on taking backups? I had setup master slave replication to one of the boxes, so only db backup i have. Hope that's enough

Persona Xtrabackup - must have option.

**zabbixfk** · 20-06-2018, 13:29

Thanks for the reply. I am kind of new to this sysadmin scene, can you elaborate on this part - apologies

Just enough setup 2 node cluster: replicated database, 1 server with fencing, web frontend on each node.

What tech should i be using , i mean how to setup this whole stuff - any pointers?

Thanks

Ad Widget

Disconnected graphs snmp as well as zabbix-agent , no data received alerts

Disconnected graphs snmp as well as zabbix-agent , no data received alerts

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment