Ad Widget

**Colttt** · 19-10-2011, 09:59

hmm.. do you tuning the database? how looks that?
how great ist you database? do you use houskeeping, when yes how often?

**gtuhl** · 19-10-2011, 13:46

The DB seems to be in excellent shape. The machine has loads of idle cpu/ram/disk, never any query backup, and I've got it tuned up nicely (I setup a lot of postgres installs).

Housekeeping was causing trouble with its row-at-a-time cleaning previously so I moved to an approach of:

- Disabled housekeeping completely
- Don't keep detailed history for an item more than 30 days
- Run a query every morning that purges anything older than 30+ days from history, history_uint, history_str, history_text, and history_log

It perhaps isn't a recommend approach but seems to be working very well. I made that change a long time ago and it made a huge difference in terms of the stability of zabbix improving.

Generally everything is running great, it just falls apart occasionally and goes from having 0 items past the "10 second" line in the queue to having hundreds.

It's very odd, and only happens once every day or so (has not happened since I created this thread).

I am wondering if I need to setup some proxies so I am spreading the outbound connections to all our hosts across a few different machines. Could be a spike in network latency to enough hosts could cause the single zabbix server to get tied up such that it can't continue bringing in data.

**gtuhl** · 21-10-2011, 19:13

We had another one of these events today and I believe I've finally figured it out. It had absolutely nothing to do with Zabbix so wanted to wrap this thread for any future readers.

In our case it was some borderline malicious caching behavior in our host's DNS servers. I've swapped in google's and everything works flawlessly - the huge backup cleared in a matter of minutes. The outbound connections were all getting hung up on DNS lookups that never returned.

Ad Widget

Queue Backups

Queue Backups

Comment

Comment

Comment