Ad Widget

**tchjts1** · 26-07-2014, 13:34

On Zabbix server, in zabbix_server.conf, try increasing your Timeout= value to 30 and restart your Zabbix server process. I would do the same for one of your hosts in zabbix_agentd.conf and restart the agent. See if you start getting solid data for that host.

Next, I would look at this post, at the last paragraph that describes how to check your Zabbix internal processes. Maybe something is overloaded. If you post your graphs similar as mine in this link, please post a 24 hour view.

We’ll be back soon!

https://www.zabbix.com/forum/showthread.php?t=41219

**myth0s** · 28-07-2014, 19:56

I deleted all my hosts so they would reconnect with only the "Linux active" template. At first, they had connected to Zabbix before I had created the "Linux active" template so I had to unlink them from the first template and relink them to the Active one.

The server graph had no data (my Zabbix server in the dropdown was red), so I rebooted the server.

Finally, even though the server CPU and memory was fine, Zabbix reported as much as 24% iowait. So I decided to switch to a machine with more IOPS.

All in all, the queue is still pretty high (some hosts in 1+minutes, but only one in 10+minutes), but it is better and data is coming in.

Also, since the hosts are all reporting within 5 minutes, the host unreachable alert is gone for all but one host.

Thank you for the assistance

**tchjts1** · 29-07-2014, 00:04

Originally posted by myth0s

Finally, even though the server CPU and memory was fine, Zabbix reported as much as 24% iowait. So I decided to switch to a machine with more IOPS.

Regarding your high iowait, may be worth seeing this post: https://www.zabbix.com/forum/showthread.php?t=38575

I know you switched machines, but that may still be worth noting.

**myth0s** · 06-08-2014, 20:08

Let me do a follow-up in case anyone ends up here with Google's help.

I think we managed to solve the issue, and to do so we optimized/change some things:

We changed from MySQL MyISAM to MySQL InnoDB
We changed the MySQL storage mount to have the noatime flag
We disabled Agent ping, Host name and Version of zabbix_agent in the Template App Zabbix Agent Active (hoping that it would solve the "host unreachable" issue
We augmented the number of Unreachable poller from 1 to 15 (the Unreachable poller showed up busy 100% on the graph)
We also reviewed the discovery rule of our "Active" template. Turns out our default template was cloned from Template OS Linux. The items' type was changed from "Zabbix agent" to "Zabbix agent (active)", but the Discovery Rules and Item prototypes haven't. (Probably the actual cause of all our Host unreachable)
Finally we deleted all the hosts from Zabbix to "refresh" everything. Upon startup, the "Queue of items to be update" changed from the previous 600+ items to a small 50-100ish. And they were processed quite promptly (we believe this queue to be erroneous and caused by ZBX-8488)

The Administration --> Queue - overview table now shows most of our items under the 5 minutes delay.

Ad Widget

Sparse data & big queue with active agents only

Sparse data & big queue with active agents only

Comment

Comment

Comment

Comment