Scaling zabbix storage engine beyond SQL with Handlersocket

fmrapid

Member

Joined: Aug 2010

Posts: 43
#1

Scaling zabbix storage engine beyond SQL with Handlersocket

22-04-2011, 05:31

Scaling zabbix performance

There are many topics to be considered when scaling a time-series database beyond current possibilities. Zabbix currently faces issues with three points, and the current post will focus on one of these, back-end scalability.

Issue 1 : By reducing the number of NVPS from agents
Sending agent data back only on exception with a dead band. See ZBXNEXT-113 and references to Ganglia.

Issue - 2 : By increasing back-end scalability
Using NoSQL methods for storing and reading time-series data.

Issue - 3 : By increasing efficiency of the storage algorithm
See references
Swinging Door
Derivatime time-series Segment Approximation(DSA). "Effective and efficient similarity search in time series"
Comparison of compression algorithms http://www.castdiv.org/archive/data_compression.pdf
Use a single efficient algorithm for compression for the trend tables. Instead of the gross approximation that is one hour min/max/avg.

=======

Todays topic: Increasing scalability

Using a NosQL method:

Handlersocket is a MySQL plugin that permits the use of NoSQL read and write methods against the mature InnoDB storage engine to multiply the number of queries that can actual be run against a non I/O bound database. If the database does not fit in memory, Handlersocket offers no benefit, which is a bummer for Zabbix, as the History and Trends databases grow easily in the hundreds of Gigabytes.

References:

HandlerSocket: The NoSQL MySQL & Ruby - igvita.com

http://www.igvita.com/2011/01/14/handlersocket-the-nosql-mysql-ruby/

Using MySQL as a NoSQL - A story for exceeding 750,000 qps on a commodity server

http://yoshinorimatsunobu.blogspot.com/2010/10/using-mysql-as-nosql-story-for.html

UPDATE: Oracle officially released memcached daemon plugin that talks with InnoDB. I'm glad to see that NoSQL+MySQL has become an official ...

Use of a NoSQL database for storage and retrieval of time-series data, high number of queries per second and also benefits from inherent HA and replication for massive data sets. Now we are talking.
References:
TokioCabinet, Hbase and other NoSQL databases that are geared to Time-Series data and that do not require extensive programming modifications.

Strategy for Zabbix:

Using NoSQL database for the History and Trends table; The trends table could now make use of a more efficient compression algorithm that can be allowed to take up more space but provide much better data accuracy.

Keep the SQL engine for all other tables.

What do you think Alexei.

You mentioned that Zabbix 2.x would look into NoSQL, is that still in the cards?

Cheers

Tomatos-for-Dollars

Last edited by fmrapid; 22-04-2011, 14:34.
Tags: dsa, handlersocket, nosql, sql
nelsonab

Senior Member

Joined: Sep 2006

Posts: 1233
#2

28-04-2011, 05:54

I think a fork is the only way we might see some community development on this. If this is a goal internally for Zabbix SIA, I don't think we'll hear about it for a while.

If someone is interested in heading up this fork, I'd be interested in helping.

I would however want the fork to be merged back into the Zabbix trunk at some point in the future though.

RHCE, author of zbxapi
Ansible, the missing piece (Zabconf 2017): https://www.youtube.com/watch?v=R5T9NidjjDE
Zabbix and SNMP on Linux (Zabconf 2015): https://www.youtube.com/watch?v=98PEHpLFVHM
Comment

Ad Widget

Scaling zabbix storage engine beyond SQL with Handlersocket

Scaling zabbix storage engine beyond SQL with Handlersocket

Comment