Ad Widget

**insider** · 14-06-2013, 16:43

2x120Go SSD

Will not be sufficient for big environment, unless you will not keep history at all.

MySQL replication based on DRBD

For my opinion, DRBD is too slow to cope with high i/o
I think it's better to use master-master replication with active/passive load balancing. I'm looking at pacemaker + corosync.

**trikke76** · 29-07-2013, 09:40

SOFT RAID1

in my opinion the only reason to use soft raid is for compatability afterwards

switch to hardware raid it's much more performant.

**mushero** · 22-08-2013, 17:51

Agreed on both:

- Use hardware RAID, always, with battery-backed cache, like Dell PERC

- Use fast disks like 15K SAS; SSD nice but small if you want data as noted

- DRDB is too slow in most configs; we have a customer now with 25-50ms update latency which is deadly for DBs; use master-slave instead

- RAM is cheap, buy more, at least 64GB in 8x8 or 4x16GB so you can add more.

Nice Dell R420 with 64GB of RAM and PERC and 4x600G SAS 15K in RAID10 for 1.2TB of fast space is a nice starting point for a DB. 2x6 core CPU. Or bigger at 128GB for R720 with 6-8 x 600GB RAID10 disks.

**Vaku** · 20-09-2019, 13:38

Originally posted by insider

Will not be sufficient for big environment, unless you will not keep history at all.

For my opinion, DRBD is too slow to cope with high i/o
I think it's better to use master-master replication with active/passive load balancing. I'm looking at pacemaker + corosync.

DRBD is much faster than Mysql.
1) DRBD works in kernel space in syncs only data blocks that were changed - this data is already after data processing by mysql, you are synchronizing the result.
2) Mysql does a lot of things in userspace, which is already much slower and it uses transactions - hence data processing, double work, which also is an overhead and latency

IO impact mostly caused of using InnoDB, which is not suitible for zabbix type of workload.
Consider switching from InnoDB to NoSQL for history and trend tables. This will significantly reduce IO overhead.
DRBD is most efficient at direct replication, join two servers with cross-over with dedicated network interface and avoid traffic routing and interference.
Then this should be good to go.

**Vaku** · 20-09-2019, 13:41

Originally posted by insider

Will not be sufficient for big environment, unless you will not keep history at all.

For my opinion, DRBD is too slow to cope with high i/o
I think it's better to use master-master replication with active/passive load balancing. I'm looking at pacemaker + corosync.

Master-Master replication is not stable for zabbix workload.
You will experience eventual cluster-wide deadlocks and serious downtime, which eliminates all the HA purpose.

**Markku** · 25-01-2020, 13:58

Originally posted by Vaku

Master-Master replication is not stable for zabbix workload.
You will experience eventual cluster-wide deadlocks and serious downtime, which eliminates all the HA purpose.

Hi Vaku , can you tell more about this? I've run some 40+GB Zabbix database with 200+ NVPS in MariaDB master-master replication with no problems. Is there some specific scale or usage scenarios that you are thinking about? I don't have any clients connecting to the other master however.

Markku

**Vaku** · 17-02-2020, 11:49

Originally posted by Markku

Hi Vaku , can you tell more about this? I've run some 40+GB Zabbix database with 200+ NVPS in MariaDB master-master replication with no problems. Is there some specific scale or usage scenarios that you are thinking about? I don't have any clients connecting to the other master however.

Markku

Hi, that may depend on environment. Well, it may be OK to run on bare metal.
But default installation MariaDB with galera master-master replication on a cluster of two VMware VM's cannot handle such a load of 200 NVPS without special configuration and DB tweaking.
In production It often results in unexpected nasty cluster-wide deadlocks, which eliminates HA, and that can be found on many threads on Percona forum with people frustration on sudden cluster-wide deadlocks and no any useful response from Percona.
Same problems have experienced we. We had a HA master-master galera replication with ProxySQL separating reads and writes and it was a nightmare to fix Zabbix "HA" cluster couple of times every month.
Default InnoDB storage is extremely ineffective for storing and accessing Zabbix type of data, which is time-based.
So for it to work with high load as 200+ NVPS stable and without extra hardware resources, there could be an option to use RocksDB storage engine within MariaDB or moving to ClickHouse.
That is the reason Zabbix have finally enabled support for ClickHouse.

Ad Widget

HA, scalability & security: best practices

HA, scalability & security: best practices

Comment

Comment

Comment

Comment

Comment

Comment

Comment