I have a user parameter that triggers a shell script on the monitored machine which builds a file of multiple mysql server data and then sends it via zabbix_sender to the server. The user parameter then returns the current result of show processlist to the item that called it.
So I have a Zabbix agent item which is a single user parameter, then roughly a dozen Zabbix trappers that receive the results from zabbix_sender. This has worked well for the first eight servers I set up. All of the monitored machines are OS X and all agents and server are version 1.6. Every server is set up identically. The Zabbix server is Ubuntu.
The two most recent monitored machines have an odd error. Initially they will work fine, and they will enter into the zabbix_agentd log:
During this time the data is received by the server. The Zabbix agent item is checked every fifteen minuts (causing the script to run each time), and after an hour or so I begin to see this in the logs:
And none of the trappers receive data. Nothing has changed on host or monitored machine, nothing restarted. At this point the script will see this output from the command it runs:
and that's the case even though the zabbix_sender is called with the -vv flag. At this point I can manually run (as the zabbix user) from the monitored machine the exact zabbix_sender command that the script runs and it will work fine. If I use zabbix_get from the server to fetch the user parameter (and thus trigger the script) it fails.
Restarting the agent fixes this, but only for an hour. And again, it's only on two machines, the other eight have had no problems.
Any ideas? Thanks a lot.
So I have a Zabbix agent item which is a single user parameter, then roughly a dozen Zabbix trappers that receive the results from zabbix_sender. This has worked well for the first eight servers I set up. All of the monitored machines are OS X and all agents and server are version 1.6. Every server is set up identically. The Zabbix server is Ubuntu.
The two most recent monitored machines have an odd error. Initially they will work fine, and they will enter into the zabbix_agentd log:
Code:
zabbix_sender [61090]: DEBUG: Answer [{
"response":"success",
"info":"Processed 14 Failed 0 Total 14 Seconds spent 0.042351"}]
Code:
zabbix_sender [64637]: DEBUG: Send value error: Cannot connect to [zabbix.xxx.lan:10051] [Permission denied]
Code:
Sending failed. Use option -vv for more detailed output.
Restarting the agent fixes this, but only for an hour. And again, it's only on two machines, the other eight have had no problems.
Any ideas? Thanks a lot.
Comment