Ad Widget

**rts** · 19-10-2007, 11:38

We've got similar problems. Certain servers are rotated out of production to do processing tasks, and we know they're going to be heavily loaded. However, Zabibx reports them as being offline, when actually when we know they're still online, just a bit busy. What's the solution here?

**oliverm** · 20-10-2007, 20:47

Same here. We have a client with a ropy old web server. A couple of times a day they have it do something (zip up logs before downloading, unzip some large client upload) and the alerts go off.

Problem is, we have learnt to ignore them, which is worse that acting on them. I'd love to find a solution.

Olly

**gryphius** · 22-10-2007, 17:49

we have learnt to ignore them

As I can't actually ignore my mobile ringing at 4 am I have increased the timeout until zabbix sends a sms. But this is just a workaround I'm not really happy with. Anyone got a solution that solves the real problem, eg. zabbix not responding?

**oliverm** · 22-10-2007, 18:09

Thinking wildly here. If the server is over loaded just before it stops responding, could you perhaps set a dependancy on the alert/trigger so that it checks to see the last value of the CPU usage ?

**nelsonab** · 22-10-2007, 21:10

Try changing the "nice" level for the agent process so it has higher priority than other other processes. This may improve the changes that the agent will receive a slice of the cpu when the Zabbix server tries to connect with the agent. I don't think the agent is able to set it's nice level as of yet so it will have to be done manually. I haven't played with nice much lately and my brain is a little fuzzy so a quick education about renice would be in order before you use it. :-) Top can also change the nice level, but don't set it too low or your computer may be unresponsive.

**rts** · 05-11-2007, 11:52

One solution

I've managed to stop receiving alerts by changing the UnreachablePeriod in zabbix_server.conf from 45 seconds to 180 seconds. My understanding of this parameter is that the host is demed unreachable if no items are returned within this time period.

If my understanding is wrong, then I'd love someone to correct me. However, it does seem to have the desired effect.

Ad Widget

Agent not responding when host busy?

Agent not responding when host busy?

Comment

Comment

Comment

Comment

Comment

Comment