Intel_Qlogic Infiniband SNMP
Overview
For Zabbix version: 6.2 and higher
Setup
Refer to the vendor documentation.
Zabbix configuration
No specific Zabbix configuration is required.
Macros used
Name | Description | Default |
---|---|---|
{$FAN_CRIT_STATUS} | - |
3 |
{$ICMP_LOSS_WARN} | - |
20 |
{$ICMP_RESPONSE_TIME_WARN} | - |
0.15 |
{$IF.ERRORS.WARN} | - |
2 |
{$IF.UTIL.MAX} | - |
90 |
{$IFCONTROL} | - |
1 |
{$NET.IF.IFADMINSTATUS.MATCHES} | Ignore notPresent(6) |
^.* |
{$NET.IF.IFADMINSTATUS.NOT_MATCHES} | Ignore down(2) administrative status |
^2$ |
{$NET.IF.IFALIAS.MATCHES} | - |
.* |
{$NET.IF.IFALIAS.NOT_MATCHES} | - |
CHANGE_IF_NEEDED |
{$NET.IF.IFDESCR.MATCHES} | - |
.* |
{$NET.IF.IFDESCR.NOT_MATCHES} | - |
CHANGE_IF_NEEDED |
{$NET.IF.IFNAME.MATCHES} | - |
^.*$ |
{$NET.IF.IFNAME.NOT_MATCHES} | Filter out loopbacks, nulls, docker veth links and docker0 bridge by default |
(^Software Loopback Interface|^NULL[0-9.]*$|^[Ll]o[0-9.]*$|^[Ss]ystem$|^Nu[0-9.]*$|^veth[0-9a-z]+$|docker[0-9]+|br-[a-z0-9]{12}) |
{$NET.IF.IFOPERSTATUS.MATCHES} | - |
^.*$ |
{$NET.IF.IFOPERSTATUS.NOT_MATCHES} | Ignore notPresent(6) |
^6$ |
{$NET.IF.IFTYPE.MATCHES} | - |
.* |
{$NET.IF.IFTYPE.NOT_MATCHES} | - |
CHANGE_IF_NEEDED |
{$PSU_CRIT_STATUS} | - |
3 |
{$PSU_WARN_STATUS} | - |
4 |
{$SNMP.TIMEOUT} | - |
5m |
{$TEMP_CRIT_LOW} | - |
5 |
{$TEMP_CRIT_STATUS} | - |
3 |
{$TEMP_CRIT} | - |
60 |
{$TEMP_WARN_STATUS} | - |
2 |
{$TEMP_WARN} | - |
50 |
Template links
There are no template links in this template.
Discovery rules
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN Discovery | icsChassisFanDescription of icsChassisFanTable |
SNMP | fan.discovery |
Network interfaces discovery | Discovering interfaces from IF-MIB. |
SNMP | net.if.discovery Filter: AND- {#IFADMINSTATUS} MATCHES_REGEX - {#IFADMINSTATUS} NOT_MATCHES_REGEX - {#IFOPERSTATUS} MATCHES_REGEX - {#IFOPERSTATUS} NOT_MATCHES_REGEX - {#IFNAME} MATCHES_REGEX - {#IFNAME} NOT_MATCHES_REGEX - {#IFDESCR} MATCHES_REGEX - {#IFDESCR} NOT_MATCHES_REGEX - {#IFALIAS} MATCHES_REGEX - {#IFALIAS} NOT_MATCHES_REGEX - {#IFTYPE} MATCHES_REGEX - {#IFTYPE} NOT_MATCHES_REGEX |
PSU Discovery | A textual description of the power supply, that can be assigned by the administrator. |
SNMP | psu.discovery |
Temperature Discovery | Discovering sensor's table with temperature filter |
SNMP | temp.discovery Filter: AND- {#SENSOR_TYPE} MATCHES_REGEX |
Unit Discovery | - |
SNMP | unit.discovery Filter: AND_OR- {#ENT_CLASS} MATCHES_REGEX |
Items collected
Group | Name | Description | Type | Key and additional info |
---|---|---|---|---|
Fans | {#SNMPVALUE}: Fan status | MIB: ICS-CHASSIS-MIB The operational status of the fan unit. |
SNMP | sensor.fan.status[icsChassisFanOperStatus.{#SNMPINDEX}] |
General | SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP_TRAP | snmptrap.fallback |
General | System location | MIB: SNMPv2-MIB The physical location of this node (e.g., |
||
General | System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP | system.contact[sysContact.0] Preprocessing: - DISCARD_UNCHANGED_HEARTBEAT: |
General | System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
||
General | System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP | system.name Preprocessing: - DISCARD_UNCHANGED_HEARTBEAT: |
General | System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP | system.descr[sysDescr.0] Preprocessing: - DISCARD_UNCHANGED_HEARTBEAT: |
Inventory | Hardware model name | MIB: ICS-CHASSIS-MIB |
SNMP | system.hw.model Preprocessing: - REGEX: - DISCARD_UNCHANGED_HEARTBEAT: |
Inventory | Firmware version | MIB: ICS-CHASSIS-MIB |
SNMP | system.hw.firmware Preprocessing: - REGEX: - DISCARD_UNCHANGED_HEARTBEAT: |
Inventory | {#ENT_NAME}: Hardware serial number | MIB: ICS-CHASSIS-MIB The serial number of the FRU. If not available, this value is a zero-length string. |
SNMP | system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}] Preprocessing: - DISCARD_UNCHANGED_HEARTBEAT: |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Operational status | MIB: IF-MIB The current operational state of the interface. - The testing(3) state indicates that no operational packet scan be passed - If ifAdminStatus is down(2) then ifOperStatus should be down(2) - If ifAdminStatus is changed to up(1) then ifOperStatus should change to up(1) if the interface is ready to transmit and receive network traffic - It should change todormant(5) if the interface is waiting for external actions (such as a serial line waiting for an incoming connection) - It should remain in the down(2) state if and only if there is a fault that prevents it from going to the up(1) state - It should remain in the notPresent(6) state if the interface has missing(typically, hardware) components. |
SNMP | net.if.status[ifOperStatus.{#SNMPINDEX}] |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Bits received | MIB: IF-MIB The total number of octets received on the interface, including framing characters. This object is a 64-bit version of ifInOctets. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP | net.if.in[ifHCInOctets.{#SNMPINDEX}] Preprocessing: - CHANGE_PER_SECOND - MULTIPLIER: |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Bits sent | MIB: IF-MIB The total number of octets transmitted out of the interface, including framing characters. This object is a 64-bit version of ifOutOctets.Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP | net.if.out[ifHCOutOctets.{#SNMPINDEX}] Preprocessing: - CHANGE_PER_SECOND - MULTIPLIER: |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Inbound packets with errors | MIB: IF-MIB For packet-oriented interfaces, the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of inbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP | net.if.in.errors[ifInErrors.{#SNMPINDEX}] Preprocessing: - CHANGE_PER_SECOND |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Outbound packets with errors | MIB: IF-MIB For packet-oriented interfaces, the number of outbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of outbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP | net.if.out.errors[ifOutErrors.{#SNMPINDEX}] Preprocessing: - CHANGE_PER_SECOND |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Outbound packets discarded | MIB: IF-MIB The number of outbound packets which were chosen to be discarded even though no errors had been detected to prevent their being deliverable to a higher-layer protocol. One possible reason for discarding such a packet could be to free up buffer space. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP | net.if.out.discards[ifOutDiscards.{#SNMPINDEX}] Preprocessing: - CHANGE_PER_SECOND |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Inbound packets discarded | MIB: IF-MIB The number of inbound packets which were chosen to be discarded even though no errors had been detected to prevent their being deliverable to a higher-layer protocol. One possible reason for discarding such a packet could be to free up buffer space. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP | net.if.in.discards[ifInDiscards.{#SNMPINDEX}] Preprocessing: - CHANGE_PER_SECOND |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Interface type | MIB: IF-MIB The type of interface. Additional values for ifType are assigned by the Internet Assigned Numbers Authority (IANA), through updating the syntax of the IANAifType textual convention. |
SNMP | net.if.type[ifType.{#SNMPINDEX}] Preprocessing: - DISCARD_UNCHANGED_HEARTBEAT: |
Network interfaces | Interface {#IFNAME}({#IFALIAS}): Speed | MIB: IF-MIB An estimate of the interface's current bandwidth in units of 1,000,000 bits per second. If this object reports a value of |
||
Power supply | {#SNMPVALUE}: Power supply status | MIB: ICS-CHASSIS-MIB Actual status of the power supply: (1) unknown: status not known. (2) disabled: power supply is disabled. (3) failed - power supply is unable to supply power due to failure. (4) warning - power supply is supplying power, but an output or sensor is bad or warning. (5) standby - power supply believed usable,but not supplying power. (6) engaged - power supply is supplying power. (7) redundant - power supply is supplying power, but not needed. (8) notPresent - power supply is supplying power is not present. |
SNMP | sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}] |
Status | Uptime | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP | system.uptime[sysUpTime.0] Preprocessing: - MULTIPLIER: |
Status | SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
INTERNAL | zabbix[host,snmp,available] |
Status | ICMP ping | - |
SIMPLE | icmpping |
Status | ICMP loss | - |
SIMPLE | icmppingloss |
Status | ICMP response time | - |
SIMPLE | icmppingsec |
Temperature | {#SENSOR_INFO}: Temperature | MIB: ICS-CHASSIS-MIB The current value read from the sensor. |
SNMP | sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}] |
Temperature | {#SENSOR_INFO}: Temperature status | MIB: ICS-CHASSIS-MIB The operational status of the sensor. |
SNMP | sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}] |
Triggers
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPVALUE}: Fan is in critical state | Please check the fan unit |
count(/Intel_Qlogic Infiniband SNMP/sensor.fan.status[icsChassisFanOperStatus.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS}")=1 |
AVERAGE | |
System name has changed | System name has changed. Ack to close. |
last(/Intel_Qlogic Infiniband SNMP/system.name,#1)<>last(/Intel_Qlogic Infiniband SNMP/system.name,#2) and length(last(/Intel_Qlogic Infiniband SNMP/system.name))>0 |
INFO | Manual close: YES |
Firmware has changed | Firmware version has changed. Ack to close |
last(/Intel_Qlogic Infiniband SNMP/system.hw.firmware,#1)<>last(/Intel_Qlogic Infiniband SNMP/system.hw.firmware,#2) and length(last(/Intel_Qlogic Infiniband SNMP/system.hw.firmware))>0 |
INFO | Manual close: YES |
{#ENT_NAME}: Device has been replaced | Device serial number has changed. Ack to close |
last(/Intel_Qlogic Infiniband SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}],#1)<>last(/Intel_Qlogic Infiniband SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}],#2) and length(last(/Intel_Qlogic Infiniband SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}]))>0 |
INFO | Manual close: YES |
Interface {#IFNAME}({#IFALIAS}): Link down | This trigger expression works as follows: 1. Can be triggered if operations status is down. 2. {$IFCONTROL:"{#IFNAME}"}=1 - user can redefine Context macro to value - 0. That marks this interface as not important. No new trigger will be fired if this interface is down. 3. {TEMPLATE_NAME:METRIC.diff()}=1) - trigger fires only if operational status was up(1) sometime before. (So, do not fire 'ethernal off' interfaces.) WARNING: if closed manually - won't fire again on next poll, because of .diff. |
{$IFCONTROL:"{#IFNAME}"}=1 and last(/Intel_Qlogic Infiniband SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])=2 and (last(/Intel_Qlogic Infiniband SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#1)<>last(/Intel_Qlogic Infiniband SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#2)) Recovery expression: last(/Intel_Qlogic Infiniband SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])<>2 or {$IFCONTROL:"{#IFNAME}"}=0 |
AVERAGE | Manual close: YES |
Interface {#IFNAME}({#IFALIAS}): High bandwidth usage | The network interface utilization is close to its estimated maximum bandwidth. |
(avg(/Intel_Qlogic Infiniband SNMP/net.if.in[ifHCInOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) or avg(/Intel_Qlogic Infiniband SNMP/net.if.out[ifHCOutOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])) and last(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 Recovery expression: avg(/Intel_Qlogic Infiniband SNMP/net.if.in[ifHCInOctets.{#SNMPINDEX}],15m)<(({$IF.UTIL.MAX:"{#IFNAME}"}-3)/100)*last(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) and avg(/Intel_Qlogic Infiniband SNMP/net.if.out[ifHCOutOctets.{#SNMPINDEX}],15m)<(({$IF.UTIL.MAX:"{#IFNAME}"}-3)/100)*last(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) |
WARNING | Manual close: YES Depends on: - Interface {#IFNAME}({#IFALIAS}): Link down |
Interface {#IFNAME}({#IFALIAS}): High error rate | Recovers when below 80% of {$IF.ERRORS.WARN:"{#IFNAME}"} threshold |
min(/Intel_Qlogic Infiniband SNMP/net.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} or min(/Intel_Qlogic Infiniband SNMP/net.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} Recovery expression: max(/Intel_Qlogic Infiniband SNMP/net.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)<{$IF.ERRORS.WARN:"{#IFNAME}"}*0.8 and max(/Intel_Qlogic Infiniband SNMP/net.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)<{$IF.ERRORS.WARN:"{#IFNAME}"}*0.8 |
WARNING | Manual close: YES Depends on: - Interface {#IFNAME}({#IFALIAS}): Link down |
Interface {#IFNAME}({#IFALIAS}): Ethernet has changed to lower speed than it was before | This Ethernet connection has transitioned down from its known maximum speed. This might be a sign of autonegotiation issues. Ack to close. |
change(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])<0 and last(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 and ( last(/Intel_Qlogic Infiniband SNMP/net.if.type[ifType.{#SNMPINDEX}])=6 or last(/Intel_Qlogic Infiniband SNMP/net.if.type[ifType.{#SNMPINDEX}])=7 or last(/Intel_Qlogic Infiniband SNMP/net.if.type[ifType.{#SNMPINDEX}])=11 or last(/Intel_Qlogic Infiniband SNMP/net.if.type[ifType.{#SNMPINDEX}])=62 or last(/Intel_Qlogic Infiniband SNMP/net.if.type[ifType.{#SNMPINDEX}])=69 or last(/Intel_Qlogic Infiniband SNMP/net.if.type[ifType.{#SNMPINDEX}])=117 ) and (last(/Intel_Qlogic Infiniband SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])<>2) Recovery expression: (change(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 and last(/Intel_Qlogic Infiniband SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}],#2)>0) or (last(/Intel_Qlogic Infiniband SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])=2) |
INFO | Manual close: YES Depends on: - Interface {#IFNAME}({#IFALIAS}): Link down |
{#SNMPVALUE}: Power supply is in critical state | Please check the power supply unit for errors |
count(/Intel_Qlogic Infiniband SNMP/sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}],#1,"eq","{$PSU_CRIT_STATUS}")=1 |
AVERAGE | |
{#SNMPVALUE}: Power supply is in warning state | Please check the power supply unit for errors |
count(/Intel_Qlogic Infiniband SNMP/sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}],#1,"eq","{$PSU_WARN_STATUS}")=1 |
WARNING | Depends on: - {#SNMPVALUE}: Power supply is in critical state |
has been restarted | Uptime is less than 10 minutes. |
last(/Intel_Qlogic Infiniband SNMP/system.uptime[sysUpTime.0])<10m |
WARNING | Manual close: YES Depends on: - No SNMP data collection |
No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/Intel_Qlogic Infiniband SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |
WARNING | Depends on: - Unavailable by ICMP ping |
Unavailable by ICMP ping | Last three attempts returned timeout. Please check device connectivity. |
max(/Intel_Qlogic Infiniband SNMP/icmpping,#3)=0 |
HIGH | |
High ICMP ping loss | - |
min(/Intel_Qlogic Infiniband SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/Intel_Qlogic Infiniband SNMP/icmppingloss,5m)<100 |
WARNING | Depends on: - Unavailable by ICMP ping |
High ICMP ping response time | - |
avg(/Intel_Qlogic Infiniband SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} |
WARNING | Depends on: - High ICMP ping loss - Unavailable by ICMP ping |
{#SENSOR_INFO}: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Intel_Qlogic Infiniband SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)>{$TEMP_WARN:"{#SENSOR_INFO}"} or last(/Intel_Qlogic Infiniband SNMP/sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}])={$TEMP_WARN_STATUS} Recovery expression: max(/Intel_Qlogic Infiniband SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)<{$TEMP_WARN:"{#SENSOR_INFO}"}-3 |
WARNING | Depends on: - {#SENSOR_INFO}: Temperature is above critical threshold |
{#SENSOR_INFO}: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Intel_Qlogic Infiniband SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"{#SENSOR_INFO}"} or last(/Intel_Qlogic Infiniband SNMP/sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}])={$TEMP_CRIT_STATUS} Recovery expression: max(/Intel_Qlogic Infiniband SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)<{$TEMP_CRIT:"{#SENSOR_INFO}"}-3 |
HIGH | |
{#SENSOR_INFO}: Temperature is too low | - |
avg(/Intel_Qlogic Infiniband SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"{#SENSOR_INFO}"} Recovery expression: min(/Intel_Qlogic Infiniband SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)>{$TEMP_CRIT_LOW:"{#SENSOR_INFO}"}+3 |
AVERAGE |
Feedback
Please report any issues with the template at https://support.zabbix.com