How would I diagnose this, to see what the http poller processes are getting hung up on ?
I've been seeing more of this warning lately. I've only got 48 webchecks, and the load average for the machine isn't increased when I see this trigger, nor am I noticing slow response times for the remote servers.
I tried running a little bash script to monitor http sites while Zabbix does, and the bash script didn't notice timeouts when the Zabbix http poller processes did get hung up (and it's not my script causing the problem, the problem started before I tried this).
Can I get more info about the http poller processes, maybe how long each webcheck is taking to complete or which ones are timing-out?
I've been seeing more of this warning lately. I've only got 48 webchecks, and the load average for the machine isn't increased when I see this trigger, nor am I noticing slow response times for the remote servers.
I tried running a little bash script to monitor http sites while Zabbix does, and the bash script didn't notice timeouts when the Zabbix http poller processes did get hung up (and it's not my script causing the problem, the problem started before I tried this).
Can I get more info about the http poller processes, maybe how long each webcheck is taking to complete or which ones are timing-out?
Comment