Hello everyone.
Zabbix server 2.2.5 on debian 7.6 i386 from precompiled deb package.
I' m trying to configure snmp monitoring of my switches. Some switches that have snmp v3 and are configured to use it - work partially.
Here's my particular problem.
I have Cisco SG300-52 switch configured:
I took care of engineID so it's unique in my administrative domain + clock is synced with the same ntp server as zabbix server.
I made a host on zabbix server and attached the following template to it:
sg300-52_zabbix_template.xml
I'm able to ping, snmpget, snmpwalk, snmpbulkget, snmpbulkwalk from zabbix server to switch and everything works flawlessly. Btw
But when it comes to zabbix i get the following result:
and graph

This graph shows that sometimes zabbix gets snmp values, but most of the time it does not. there are even some periods of several hours when zabbix can actually draw continious graphs that last for 1 or 2 hours, but then everything stops. I consider this behaviour as not a configuration error but problem with zabbix itself. I even had a problem when zabbix didnt send community string on snmpv1 swithes and i had to remove and insert host back to resolve this problem. I have some other snmpv3 switches that work "normally" but in some rare cases they get same errors as well.
This particular switch that i show you as example of problem worked for a week or so and then suddenly i started getting those errors.
What i tryed to do:
I changed engineID and recreated users - this helped for 15 minutes.
I removed host and inserted it back again - this helped for 5-10 minutes
I powered off all other snmpv3 swithes - nothing
I played with pollers quantity - nothing, only encreased cpuload
I dont think its server performance issue
I dont know what to do.
Zabbix server 2.2.5 on debian 7.6 i386 from precompiled deb package.
I' m trying to configure snmp monitoring of my switches. Some switches that have snmp v3 and are configured to use it - work partially.
Here's my particular problem.
I have Cisco SG300-52 switch configured:
Code:
...... snmp-server server snmp-server engineID local 800000090368bc0c7a27dd snmp-server location mainOffice snmp-server view ZABBIX iso included snmp-server group ZABBIX v3 priv read ZABBIX encrypted snmp-server user zabbix ZABBIX v3 auth sha ******encpasshere*** priv ******encpasshere*** ........
I made a host on zabbix server and attached the following template to it:
sg300-52_zabbix_template.xml
I'm able to ping, snmpget, snmpwalk, snmpbulkget, snmpbulkwalk from zabbix server to switch and everything works flawlessly. Btw
Code:
root@zabbixserver01:~# dpkg -l snmp ............ ii snmp 5.4.3~dfsg-2.8 i386 SNMP (Simple Network Management Protocol) applications .......
Code:
6064:20140826:121638.108 SNMP agent item "ifOutDiscards.[86]" on host "sw-cisco-sg300-52-sob" failed: first network error, wait for 15 seconds 6069:20140826:121654.127 resuming SNMP agent checks on host "sw-cisco-sg300-52-sob": connection restored 6065:20140826:121707.804 SNMP agent item "ifOutDiscards.[80]" on host "sw-cisco-sg300-52-sob" failed: first network error, wait for 15 seconds 6069:20140826:121722.295 resuming SNMP agent checks on host "sw-cisco-sg300-52-sob": connection restored 6066:20140826:121737.216 SNMP agent item "ifOperStatus.[69]" on host "sw-cisco-sg300-52-sob" failed: first network error, wait for 15 seconds 6069:20140826:121752.460 resuming SNMP agent checks on host "sw-cisco-sg300-52-sob": connection restored 6065:20140826:121807.390 SNMP agent item "ifOperStatus.[64]" on host "sw-cisco-sg300-52-sob" failed: first network error, wait for 15 seconds 6069:20140826:121822.643 resuming SNMP agent checks on host "sw-cisco-sg300-52-sob": connection restored 6065:20140826:121837.632 SNMP agent item "ifOutDiscards.[93]" on host "sw-cisco-sg300-52-sob" failed: first network error, wait for 15 seconds 6069:20140826:121852.833 resuming SNMP agent checks on host "sw-cisco-sg300-52-sob": connection restored
This graph shows that sometimes zabbix gets snmp values, but most of the time it does not. there are even some periods of several hours when zabbix can actually draw continious graphs that last for 1 or 2 hours, but then everything stops. I consider this behaviour as not a configuration error but problem with zabbix itself. I even had a problem when zabbix didnt send community string on snmpv1 swithes and i had to remove and insert host back to resolve this problem. I have some other snmpv3 switches that work "normally" but in some rare cases they get same errors as well.
This particular switch that i show you as example of problem worked for a week or so and then suddenly i started getting those errors.
What i tryed to do:
I changed engineID and recreated users - this helped for 15 minutes.
I removed host and inserted it back again - this helped for 5-10 minutes
I powered off all other snmpv3 swithes - nothing
I played with pollers quantity - nothing, only encreased cpuload
I dont think its server performance issue
Code:
root@zabbixserver01:~# uptime 12:35:29 up 21:43, 1 user, load average: 0.46, 0.29, 0.26
Comment