Hi all,
I recently created/installed a Gentoo-based Zabbix server on AWS (v4.4); it is described as an "Appliance" (not sure how that differs from a regular Zabbix server install). I have added a couple of Hosts (Agents) to the machine we want to monitor, and ran across an issue with the most recently added one:
• The Host 'Availability' indicator (Configuration -> Hosts -> 'Availability' column), for a new Host I set up yesterday shows that it is red, with an error = "Get value from agent failed: cannot connect to [[support.adatasol.com]:10050]: [4] Interrupted system call". But I appear to be getting valid current data from this Host.
There are 2 passive checks, and 1 active check (Log file) configured for this Host. This Host SHOULD be configured the same as another host I recently set up, which is showing a green indicator. (see screenshot) But there is a fare bit of manual configuration going on - I could have missed something.
Configuration -> Hosts -> 'Availabilty' column:

From Monitoring -> Latest Data (showing both Hosts); this data does refresh every minute. One difference - the Green Host has information for the 'Disk space' check, whereas the Red Host does not. (I don't know why the Diskspace check might be failing - the other passive checks are working.) And the 'Latest Data' for the Green Host looks nearly identical to the data from the Red Host:

The active check is supposed to communicate over port 10051 to the Agent, right? It doesn't appear that the Agent is listening on that port. But when I do a port scan of the Host that is showing a green indicator, it shows the same results (10050 = Zabbix Agent, 10051 = (no result)). Doing a port scan (to ensure that firewall openings are working) does only show 1 of the two ports (this is the same result for both Hosts):

So what should I look into to figure out why this indicator is red? How do I determine if the monitors are really working correctly?
Thanks,
J
I recently created/installed a Gentoo-based Zabbix server on AWS (v4.4); it is described as an "Appliance" (not sure how that differs from a regular Zabbix server install). I have added a couple of Hosts (Agents) to the machine we want to monitor, and ran across an issue with the most recently added one:
• The Host 'Availability' indicator (Configuration -> Hosts -> 'Availability' column), for a new Host I set up yesterday shows that it is red, with an error = "Get value from agent failed: cannot connect to [[support.adatasol.com]:10050]: [4] Interrupted system call". But I appear to be getting valid current data from this Host.
There are 2 passive checks, and 1 active check (Log file) configured for this Host. This Host SHOULD be configured the same as another host I recently set up, which is showing a green indicator. (see screenshot) But there is a fare bit of manual configuration going on - I could have missed something.
Configuration -> Hosts -> 'Availabilty' column:
From Monitoring -> Latest Data (showing both Hosts); this data does refresh every minute. One difference - the Green Host has information for the 'Disk space' check, whereas the Red Host does not. (I don't know why the Diskspace check might be failing - the other passive checks are working.) And the 'Latest Data' for the Green Host looks nearly identical to the data from the Red Host:
The active check is supposed to communicate over port 10051 to the Agent, right? It doesn't appear that the Agent is listening on that port. But when I do a port scan of the Host that is showing a green indicator, it shows the same results (10050 = Zabbix Agent, 10051 = (no result)). Doing a port scan (to ensure that firewall openings are working) does only show 1 of the two ports (this is the same result for both Hosts):
So what should I look into to figure out why this indicator is red? How do I determine if the monitors are really working correctly?
Thanks,
J
(My server is installed on Gentoo, using a Zabbix 4.4 Appliance AMI from COG.)
Comment