Hi,
I have a small zabbix installation monitoring some Linux computers (about 25). I have a variety of Linux distros and versions. I am experiencing a problem with ssh reporting. On 3 of my computers ssh is shown in zabbix as being down. The message is:
SSH server is down on [computer name]
This is a message from the service trigger expression:
{net.tcp.service[ssh].last(0)}=0
In each case the corresponding process trigger is not being activated:
{proc.num[sshd].last(0)}<1
and an investigation on each of the 3 computers shows sshd running and each computer will accept ssh connections correctly. I have no other false positives on these or any other of my Linux computers.
1. Has anyone seen similar behavior and found a solution?
2. Does anyone know exactly what the net.tcp.service check is looking for exactly? Does it look at the port and if so, what does it look for in response? Does it look for the service running and grep out ssh using the service --status-all command?
So far I've tried:
-- compiling the agent natively on the 3 malfunctioning computers
-- comparing OS versions and settings
-- turning off iptables and SELinux
Thanks in advance.
I have a small zabbix installation monitoring some Linux computers (about 25). I have a variety of Linux distros and versions. I am experiencing a problem with ssh reporting. On 3 of my computers ssh is shown in zabbix as being down. The message is:
SSH server is down on [computer name]
This is a message from the service trigger expression:
{net.tcp.service[ssh].last(0)}=0
In each case the corresponding process trigger is not being activated:
{proc.num[sshd].last(0)}<1
and an investigation on each of the 3 computers shows sshd running and each computer will accept ssh connections correctly. I have no other false positives on these or any other of my Linux computers.
1. Has anyone seen similar behavior and found a solution?
2. Does anyone know exactly what the net.tcp.service check is looking for exactly? Does it look at the port and if so, what does it look for in response? Does it look for the service running and grep out ssh using the service --status-all command?
So far I've tried:
-- compiling the agent natively on the 3 malfunctioning computers
-- comparing OS versions and settings
-- turning off iptables and SELinux
Thanks in advance.
Comment