Ad Widget

**filipp.sudanov** · 17-03-2015, 17:19

Try something like

Code:

strace  -s 256 -p <PID of "active checks" agent's process> -tdt

to understand what the agent is doing.

Also get a tcpdump of agent's exchange with the server - it's possible that some TCP packets are getting lost on some firewall - in such case agent has quite a long timeout until it will try to reconnect again.

**gleepwurp** · 17-03-2015, 17:24

Hi Filipp,

thanks for the suggestions... strace is something I never used before, and I didn't even think of checking with tcpdump...

I'll try those the next time I get the problem and we'll see what's going on...

Thanks again!

Gleepwurp.

**gleepwurp** · 18-03-2015, 15:48

Ok, Happened again this morning and I ran the strace on the Zabbix_agentd: Active checks process.

Indeed, the active checks process seems to have hung:

Code:

sudo strace -s 256 -p 28327 -tdt
Password: 
Process 28327 attached - interrupt to quit
 [wait(0x137f) = 28327]
pid 28327 stopped, [SIGSTOP]
 [wait(0x57f) = 28327]
pid 28327 stopped, [SIGTRAP]
09:39:56.009157 read(5,

as soon as I increase the log_levels (-R log_level_increase), the whole thing "unlocks" and resumes normal processing.

Please note that this issue always seem to occur when I have a big period of High-Level queue waiting > 10 minutes (around 80,000 items) and the Zabbix Server seems to stop accepting network connections... When the Zabbix server comes back online, some agents are locked in this "Active check hung" state.

G.

**filipp.sudanov** · 18-03-2015, 16:55

Would be cool to log such agent's behaviour starting from the moment _before_ the problem happened - but putting agent's log level 4 from the beginning (and strace + tcpdump).
But as I understand it happens randomly? Is there any way to replicate this?

**gleepwurp** · 18-03-2015, 17:12

Indeed!

No... this can work fine for a couple of days and then happen again...

I've left the LogLevel = 4 for the Zabbix Active checks only (-R log-level-increase=<active pid>, so maybe I'll see what happens in the logs next time...

Thanks for the feedback!

G.

Ad Widget

Weird Zabbix Agent freezing...

Weird Zabbix Agent freezing...

Comment

Comment

Comment

Comment

Comment