We currently have a new setup with the following:
The problem that we have is the Zabbix Values Per Second seems to take a nose-dive periodically (we average around ~1000 or so and this drops <20 sometimes). I've also noticed the SNMP polling last checks don't seem to run as frequently (proxy02 appears to be the most impacted). When I go in to "Administration -> Queue -> Queue overview" I can see a large number of values (thousands) in the 1min, 5min and more than 10mins for SNMP agent. When I go in to "Administration -> Queue -> Queue overview by proxy" I can see proxy02 has over 2,700 in more than 10mins. Any ideas on what might be happening? So far, the only devices and SNMP templates we're using on proxy02 are:
- 2x Zabbix Proxy servers (in different regions polling their own devices). Lets call these proxy01 and proxy02.
- 1x Zabbix Server centralised to collect the values from both proxy servers.
Code:
StartPollersUnreachable=8 StartPingers=8 StartDiscoverers=8 CacheSize=1G Timeout=30
- Cisco Catalyst 9500
- Cisco BGP SNMP
- Cisco IOS SNMP
- Cisco Catalyst 9300
- Cisco IOS SNMP
- Cisco Catalyst 9200
- Cisco IOS SNMP
- Cisco Nexus 9K
- Cisco Nexus 9000 Series SNMP
- Cisco ASA
- Cisco ASAv SNMP (not actually working atm)
- Proxy01 and the Zabbix server are in the same data center.
- Proxy02 and the Zabbix server are ~200ms apart from each other.
Comment