I am slowly and carefully implementing Zabbix 5.0 LTS. Everything looked OK in the server log until I added our central DC switch (a pair of Cisco Catalyst 6800 in a VSS stack). Suddenly I am starting to get:
I am using bulk requests, I increased SNMP timeout to 10 seconds -- no change. If I do snmpbulkwalk on command-line it will never ever hiccup on anything. My own tool that gets thousands of OIDs from the switch runs without any trouble whatsoever and it runs fast.
This isn't an issue with the host, it's something with the Zabbix server. Any ideas what that might be?
Code:
486979:20210922:084611.402 resuming SNMP agent checks on host "strS00": connection restored 486977:20210922:084656.930 SNMP agent item "net.if.status[ifOperStatus.345]" on host "strS00" failed: first network error, wait for 15 seconds 486979:20210922:084712.022 resuming SNMP agent checks on host "strS00": connection restored 486974:20210922:084756.171 SNMP agent item "net.if.status[ifOperStatus.269]" on host "strS00" failed: first network error, wait for 15 seconds 486979:20210922:084811.393 resuming SNMP agent checks on host "strS00": connection restored 486978:20210922:084856.319 SNMP agent item "net.if.in[ifHCInOctets.356]" on host "strS00" failed: first network error, wait for 15 seconds 486979:20210922:084911.838 resuming SNMP agent checks on host "strS00": connection restored 486975:20210922:084956.462 SNMP agent item "net.if.status[ifOperStatus.833]" on host "strS00" failed: first network error, wait for 15 seconds 486979:20210922:085011.220 resuming SNMP agent checks on host "strS00": connection restored 486976:20210922:085056.730 SNMP agent item "net.if.speed[ifHighSpeed.732]" on host "strS00" failed: first network error, wait for 15 seconds 486979:20210922:085111.592 resuming SNMP agent checks on host "strS00": connection restored
This isn't an issue with the host, it's something with the Zabbix server. Any ideas what that might be?
Comment