Hello!
I'm chasing down a rather odd problem I'm having with a couple agents in my environment. For context, we have 1000+ hosts so far monitored without issue, but 2-3 nodes have seen this issue multiple times.
Essentially, this agent's DNS entry disappears, the server no longer monitors the agent, and our slack alerts begin to fail (Error message: "Slack notification failed : Field "host_conn" cannot be empty"). Viewing the host in Configuration > Hosts, the DNS name appears blank, and the IP address reverts to 127.0.0.1:



The issue can be resolved by manually re-adding the DNS entry and clicking "Update" - However the issue randomly re-occurs (sometimes weeks, sometimes months from when manually corrected).
We do utilize autoregistration from the agent side, however we use the same template across all our servers and have not seen this issue since. For context, here's a truncated version of our agent configuration file (commented lines removed):
Any help would be greatly appreciated to try and figure out what's causing the DNS entry of our hosts to disappear.
Thanks!
I'm chasing down a rather odd problem I'm having with a couple agents in my environment. For context, we have 1000+ hosts so far monitored without issue, but 2-3 nodes have seen this issue multiple times.
Essentially, this agent's DNS entry disappears, the server no longer monitors the agent, and our slack alerts begin to fail (Error message: "Slack notification failed : Field "host_conn" cannot be empty"). Viewing the host in Configuration > Hosts, the DNS name appears blank, and the IP address reverts to 127.0.0.1:
The issue can be resolved by manually re-adding the DNS entry and clicking "Update" - However the issue randomly re-occurs (sometimes weeks, sometimes months from when manually corrected).
We do utilize autoregistration from the agent side, however we use the same template across all our servers and have not seen this issue since. For context, here's a truncated version of our agent configuration file (commented lines removed):
Code:
PidFile=/run/zabbix/zabbix_agentd.pid LogFile=/var/log/zabbix/zabbix_agentd.log LogFileSize=0 Server=zbxprox1.ams1.ourdomain.com ServerActive=zbxprox1.ams1.ourdomain.com Hostname=mtlctrl1.ams1.ourdomain.com HostnameItem=mtlctrl1.ams1.ourdomain.com HostMetadata=AMS1-Linux-Server Include=/etc/zabbix/zabbix_agentd.d/*.conf Include=/etc/zabbix/zabbix_agentd.conf.d/*.conf TLSConnect=psk TLSAccept=psk TLSPSKIdentity=mtlctrl1.ams1.ourdomain.com TLSPSKFile=/etc/zabbix/zabbix_agentd.psk
Thanks!
Comment