Ad Widget

Collapse

Hosts stay unavailable after downtime since upgrade to Zabbix 5.4.0

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • bclaeys
    Junior Member
    • Aug 2021
    • 10

    #1

    Hosts stay unavailable after downtime since upgrade to Zabbix 5.4.0

    -- SOLVED IN ZABBIX 5.4.5 ---

    Hi,

    I recently upgraded a Zabbix environment running in docker containers (centos - mysql) from 5.2.2 to 5.4.0 (and 5.4.3).

    Since the upgrade, I have issues with hosts that are shutdown at night.

    When the hosts are started again, They are marked as unavailable in the gui, with errors like "interrupted system call zabbix/No route to host":

    Click image for larger version  Name:	2021-08-17_11-58-18.png Views:	30 Size:	15.8 KB ID:	429905

    the agents connect to the zabbix server via a zabbix proxy. From within the proxy container, I can ping and telnet the monitored host.

    When I perform a test execution of the zabbix agent ping check, I get result 1 (available), and from the moment I click "Execute now" on this check, the host becomes available again. This is also visible in the zabbix proxy logs.

    So even though the host is reachable, Zabbix somehow doesn't figure it out automatically since the upgrade.

    The issue happens randomly, as some of the hosts that go down at night don't have this issue.

    Anyone else with the same issue, or any ideas how to resolve this?

    Thanks in advance!

    EDIT: I performed some more tests and the problem seems to be the Zabbix proxy. When the issue happens, a zabbix proxy restart resolves it.
    All Unreachable/unavailable parameters are set to the default values, but as this happens randomly, it doesn't seem to be a configuration problem.
    # UnavailableDelay=60
    # UnreachablePeriod=45
    # UnreachableDelay=15

    EDIT 2 : I upgraded the containers from 5.4.0 to 5.4.3 (with ubuntu image), issue still persists. Zabbix proxy doesn't notice the servers, restart of the proxy resolves it.
    Last edited by bclaeys; 13-10-2021, 16:42.
Working...