Mellanox

Mellanox Technologies Ltd. is an Israeli-American multinational supplier of computer networking products based on InfiniBand and Ethernet technology.

Dostupná řešení




This template is for Zabbix version: 7.0

Source: https://git.zabbix.com/projects/ZBX/repos/zabbix/browse/templates/net/mellanox_snmp?at=release/7.0

Mellanox by SNMP

Overview

This template is designed for the effortless deployment of Mellanox monitoring by Zabbix via SNMP and doesn't require any external scripts.

Requirements

Zabbix version: 7.0 and higher.

Tested versions

This template has been tested on:

  • Mellanox

Configuration

The template uses context macros for the temperature trigger expression. By default, it uses a macro value like {$TEMP.MAX.CRIT}. To adjust the threshold for a certain sensor you can define context macros on the host level, with a value corresponding to your device specifications, for example: {$TEMP.MAX.CRIT:"MGMT/BOARD_MONITOR"}. Please, read https://www.zabbix.com/documentation/7.0/manual/config/macros/user_macros_context for more detailed info on user context macros.

Setup

Refer to the vendor documentation.

Macros used

Name Description Default
{$IFCONTROL} 1
{$PSU.STATUS.CRIT}

The critical value of the PSU sensor for trigger expression.

2
{$ICMP_LOSS_WARN} 20
{$ICMP_RESPONSE_TIME_WARN} 0.15
{$FAN_CRIT_STATUS}

The critical value of the FAN sensor for trigger expression.

3
{$TEMP.STATUS.WARN}

The critical value of the TEMP sensor for trigger expression.

3
{$TEMP.MAX.CRIT}

The temperature maximum critical value for trigger expression.

60
{$TEMP.MAX.WARN}

The temperature maximum warning value for trigger expression.

50
{$TEMP.MIN.CRIT}

The temperature minimum critical value for trigger expression.

5
{$VFS.FS.FSNAME.NOT_MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

^(/dev|/sys|/$|/run|/proc|.+/shm$)
{$VFS.FS.FSNAME.MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

.+
{$VFS.FS.FSTYPE.NOT_MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

CHANGE_IF_NEEDED
{$VFS.FS.FSTYPE.MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

.*(\.4|\.9|hrStorageFixedDisk|hrStorageFlashMemory)$
{$MEMORY.UTIL.MAX}

The warning threshold of the "Physical memory: Memory utilization" item.

90
{$MEMORY.TYPE.NOT_MATCHES}

This macro is used in memory discovery. Can be overridden on the host level if you need to filter out results.

CHANGE_IF_NEEDED
{$MEMORY.TYPE.MATCHES}

This macro is used in memory discovery. Can be overridden on the host level.

.*(\.2|hrStorageRam)$
{$MEMORY.NAME.MATCHES}

This macro is used in memory discovery. Can be overridden on the host level.

.*
{$MEMORY.NAME.NOT_MATCHES}

This macro is used in memory discovery. Can be overridden on the host level if you need to filter out results.

CHANGE_IF_NEEDED
{$NET.IF.IFNAME.MATCHES} ^.*$
{$NET.IF.IFNAME.NOT_MATCHES}

Filter out loopbacks, nulls, docker veth links and docker0 bridge by default.

Macro too long. Please see the template.
{$NET.IF.IFOPERSTATUS.MATCHES} ^.*$
{$NET.IF.IFOPERSTATUS.NOT_MATCHES}

Ignore notPresent(6).

^6$
{$NET.IF.IFADMINSTATUS.MATCHES} ^.*
{$NET.IF.IFADMINSTATUS.NOT_MATCHES}

Ignore down(2) administrative status.

^2$
{$NET.IF.IFDESCR.MATCHES} .*
{$NET.IF.IFDESCR.NOT_MATCHES} CHANGE_IF_NEEDED
{$NET.IF.IFALIAS.MATCHES} .*
{$NET.IF.IFALIAS.NOT_MATCHES} CHANGE_IF_NEEDED
{$NET.IF.IFTYPE.MATCHES} .*
{$NET.IF.IFTYPE.NOT_MATCHES} CHANGE_IF_NEEDED
{$SNMP.TIMEOUT}

The time interval for SNMP agent availability trigger expression.

5m
{$ICMP.LOSS.WARN} 20
{$ICMP.RESPONSE_TIME.WARN} 0.15
{$VFS.FS.FREE.MIN.CRIT}

The critical threshold of the filesystem utilization.

5G
{$VFS.FS.FREE.MIN.WARN}

The warning threshold of the filesystem utilization.

10G
{$CPU.UTIL.CRIT} 90

Items

Name Description Type Key and additional info
CPU utilization

MIB: HOST-RESOURCES-MIB

The average, over the last minute, of the percentage of time that processors was not idle.

Implementations may approximate this one minute smoothing period if necessary.

SNMP agent system.cpu.util

Preprocessing

  • JSON Path: $..['{#CPU.UTIL}'].avg()

Uptime (network)

MIB: SNMPv2-MIB

The time (in hundredths of a second) since the network management portion of the system was last re-initialized.

SNMP agent system.net.uptime[sysUpTime.0]

Preprocessing

  • Custom multiplier: 0.01

Uptime (hardware)

MIB: HOST-RESOURCES-MIB

The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system.

SNMP agent system.hw.uptime[hrSystemUptime.0]

Preprocessing

  • Check for not supported value: any error

    ⛔️Custom on fail: Set value to: 0

  • Custom multiplier: 0.01

SNMP traps (fallback)

The item is used to collect all SNMP traps unmatched by other snmptrap items

SNMP trap snmptrap.fallback
System location

MIB: SNMPv2-MIB

The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string.

SNMP agent system.location[sysLocation.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

System contact details

MIB: SNMPv2-MIB

The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string.

SNMP agent system.contact[sysContact.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

System object ID

MIB: SNMPv2-MIB

The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determiningwhat kind of box' is being managed. For example, if vendorFlintstones, Inc.' was assigned the subtree1.3.6.1.4.1.4242, it could assign the identifier 1.3.6.1.4.1.4242.1.1 to its `Fred Router'.

SNMP agent system.objectid[sysObjectID.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

System name

MIB: SNMPv2-MIB

An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string.

SNMP agent system.name

Preprocessing

  • Discard unchanged with heartbeat: 12h

System description

MIB: SNMPv2-MIB

A textual description of the entity. This value should

include the full name and version identification of the system's hardware type, software operating-system, and

networking software.

SNMP agent system.descr[sysDescr.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

SNMP agent availability

Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list.

Possible values:

0 - not available

1 - available

2 - unknown

Zabbix internal zabbix[host,snmp,available]
ICMP ping Simple check icmpping
ICMP loss Simple check icmppingloss
ICMP response time Simple check icmppingsec

Triggers

Name Description Expression Severity Dependencies and additional info
Mellanox: High CPU utilization

The CPU utilization is too high. The system might be slow to respond.

min(/Mellanox by SNMP/system.cpu.util,5m)>{$CPU.UTIL.CRIT} Warning
Mellanox: Host has been restarted

Uptime is less than 10 minutes.

(last(/Mellanox by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/Mellanox by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/Mellanox by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/Mellanox by SNMP/system.net.uptime[sysUpTime.0])<10m) Warning Manual close: Yes
Depends on:
  • Mellanox: No SNMP data collection
Mellanox: System name has changed

The name of the system has changed. Acknowledge to close the problem manually.

last(/Mellanox by SNMP/system.name,#1)<>last(/Mellanox by SNMP/system.name,#2) and length(last(/Mellanox by SNMP/system.name))>0 Info Manual close: Yes
Mellanox: No SNMP data collection

SNMP is not available for polling. Please check device connectivity and SNMP settings.

max(/Mellanox by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 Warning Depends on:
  • Mellanox: Unavailable by ICMP ping
Mellanox: Unavailable by ICMP ping

Last three attempts returned timeout. Please check device connectivity.

max(/Mellanox by SNMP/icmpping,#3)=0 High
Mellanox: High ICMP ping loss min(/Mellanox by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/Mellanox by SNMP/icmppingloss,5m)<100 Warning Depends on:
  • Mellanox: Unavailable by ICMP ping
Mellanox: High ICMP ping response time avg(/Mellanox by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} Warning Depends on:
  • Mellanox: High ICMP ping loss
  • Mellanox: Unavailable by ICMP ping

LLD rule Temperature Discovery

Name Description Type Key and additional info
Temperature Discovery

ENTITY-SENSORS-MIB::EntitySensorDataType discovery with temperature filter

SNMP agent temp.discovery

Preprocessing

  • JavaScript: The text is too long. Please see the template.

  • Discard unchanged with heartbeat: 6h

Item prototypes for Temperature Discovery

Name Description Type Key and additional info
{#SENSOR_INFO}: Temperature

MIB: ENTITY-SENSORS-MIB

The most recent measurement obtained by the agent for this sensor.

To correctly interpret the value of this object, the associated entPhySensorType,

entPhySensorScale, and entPhySensorPrecision objects must also be examined.

SNMP agent sensor.temp.value[entPhySensorValue.{#SNMPINDEX}]

Preprocessing

  • Custom multiplier: 0.1

{#SENSOR_INFO}: Temperature status

MIB: ENTITY-SENSORS-MIB

The operational status of the sensor {#SENSOR_INFO}. Possible values:

- ok(1) indicates that the agent can obtain the sensor value.

- unavailable(2) indicates that the agent presently cannot obtain the sensor value.

- nonoperational(3) indicates that the agent believes the sensor is broken. The sensor could have a hard failure (disconnected wire), or a soft failure such as out-of-range, jittery, or wildly fluctuating readings.

SNMP agent sensor.temp.status[entPhySensorOperStatus.{#SNMPINDEX}]

Trigger prototypes for Temperature Discovery

Name Description Expression Severity Dependencies and additional info
Mellanox: {#SENSOR_INFO}: Temperature is above warning threshold

This trigger uses temperature sensor values as well as temperature sensor status if available

avg(/Mellanox by SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)>{$TEMP.MAX.WARN:"{#SENSOR_INFO}"} or last(/Mellanox by SNMP/sensor.temp.status[entPhySensorOperStatus.{#SNMPINDEX}])={$TEMP.STATUS.WARN} Warning Depends on:
  • Mellanox: {#SENSOR_INFO}: Temperature is above critical threshold
Mellanox: {#SENSOR_INFO}: Temperature is above critical threshold

This trigger uses temperature sensor values as well as temperature sensor status if available

avg(/Mellanox by SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)>{$TEMP.MAX.CRIT:"{#SENSOR_INFO}"} High
Mellanox: {#SENSOR_INFO}: Temperature is too low avg(/Mellanox by SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)<{$TEMP.MIN.CRIT:"{#SENSOR_INFO}"} Average

LLD rule Fan Discovery

Name Description Type Key and additional info
Fan Discovery

ENTITY-SENSORS-MIB::EntitySensorDataType discovery with rpm filter

SNMP agent fan.discovery

Preprocessing

  • JavaScript: The text is too long. Please see the template.

  • Discard unchanged with heartbeat: 6h

Item prototypes for Fan Discovery

Name Description Type Key and additional info
{#SENSOR_INFO}: Fan speed

MIB: ENTITY-SENSORS-MIB

The most recent measurement obtained by the agent for this sensor.

To correctly interpret the value of this object, the associated entPhySensorType,

entPhySensorScale, and entPhySensorPrecision objects must also be examined.

SNMP agent sensor.fan.speed[entPhySensorValue.{#SNMPINDEX}]
{#SENSOR_INFO}: Fan status

MIB: ENTITY-SENSORS-MIB

The operational status of the sensor {#SENSOR_INFO}

SNMP agent sensor.fan.status[entPhySensorOperStatus.{#SNMPINDEX}]

Trigger prototypes for Fan Discovery

Name Description Expression Severity Dependencies and additional info
Mellanox: {#SENSOR_INFO}: Fan is in critical state

Please check the fan unit

count(/Mellanox by SNMP/sensor.fan.status[entPhySensorOperStatus.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS}")=1 Average

LLD rule Entity Discovery

Name Description Type Key and additional info
Entity Discovery SNMP agent entity.discovery

Preprocessing

  • Discard unchanged with heartbeat: 6h

Item prototypes for Entity Discovery

Name Description Type Key and additional info
{#ENT_NAME}: Hardware model name

MIB: ENTITY-MIB

SNMP agent system.hw.model[entPhysicalModelName.{#SNMPINDEX}]

Preprocessing

  • Discard unchanged with heartbeat: 1d

{#ENT_NAME}: Hardware serial number

MIB: ENTITY-MIB

SNMP agent system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Trigger prototypes for Entity Discovery

Name Description Expression Severity Dependencies and additional info
Mellanox: {#ENT_NAME}: Device has been replaced

Device serial number has changed. Acknowledge to close the problem manually.

last(/Mellanox by SNMP/system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}],#1)<>last(/Mellanox by SNMP/system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}],#2) and length(last(/Mellanox by SNMP/system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}]))>0 Info Manual close: Yes

LLD rule PSU Discovery

Name Description Type Key and additional info
PSU Discovery SNMP agent psu.discovery

Preprocessing

  • Discard unchanged with heartbeat: 6h

Item prototypes for PSU Discovery

Name Description Type Key and additional info
{#ENT_NAME}: Power supply status

MIB: ENTITY-STATE-MIB

SNMP agent sensor.psu.status[entStateOper.{#SNMPINDEX}]

Trigger prototypes for PSU Discovery

Name Description Expression Severity Dependencies and additional info
Mellanox: {#ENT_NAME}: Power supply is in critical state

Please check the power supply unit for errors

count(/Mellanox by SNMP/sensor.psu.status[entStateOper.{#SNMPINDEX}],#1,"eq","{$PSU.STATUS.CRIT}")=1 Average

LLD rule Storage discovery

Name Description Type Key and additional info
Storage discovery

HOST-RESOURCES-MIB::hrStorage discovery with storage filter.

SNMP agent vfs.fs.discovery[snmp]

Preprocessing

  • Discard unchanged with heartbeat: 6h

Item prototypes for Storage discovery

Name Description Type Key and additional info
{#FSNAME}: Used space

MIB: HOST-RESOURCES-MIB

The amount of the storage represented by this entry that is allocated, in units of hrStorageAllocationUnits.

SNMP agent vfs.fs.used[hrStorageUsed.{#SNMPINDEX}]

Preprocessing

  • Custom multiplier: {#ALLOC_UNITS}

{#FSNAME}: Total space

MIB: HOST-RESOURCES-MIB

The size of the storage represented by this entry, in units of hrStorageAllocationUnits.

This object is writable to allow remote configuration of the size of the storage area in those cases where such an operation makes sense and is possible on the underlying system.

For example, the amount of main storage allocated to a buffer pool might be modified or the amount of disk space allocated to virtual storage might be modified.

SNMP agent vfs.fs.total[hrStorageSize.{#SNMPINDEX}]

Preprocessing

  • Custom multiplier: {#ALLOC_UNITS}

{#FSNAME}: Space utilization

The space utilization expressed in % for {#FSNAME}.

Calculated vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}]

Trigger prototypes for Storage discovery

Name Description Expression Severity Dependencies and additional info
Mellanox: {#FSNAME}: Disk space is critically low

Two conditions should match: First, space utilization should be above {$VFS.FS.PUSED.MAX.CRIT:"{#FSNAME}"}.
Second condition should be one of the following:
- The disk free space is less than {$VFS.FS.FREE.MIN.CRIT:"{#FSNAME}"}.
- The disk will be full in less than 24 hours.

last(/Mellanox by SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}])>{$VFS.FS.PUSED.MAX.CRIT:"{#FSNAME}"} and ((last(/Mellanox by SNMP/vfs.fs.total[hrStorageSize.{#SNMPINDEX}])-last(/Mellanox by SNMP/vfs.fs.used[hrStorageUsed.{#SNMPINDEX}]))<{$VFS.FS.FREE.MIN.CRIT:"{#FSNAME}"} or timeleft(/Mellanox by SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}],1h,100)<1d) Average Manual close: Yes
Mellanox: {#FSNAME}: Disk space is low

Two conditions should match: First, space utilization should be above {$VFS.FS.PUSED.MAX.WARN:"{#FSNAME}"}.
Second condition should be one of the following:
- The disk free space is less than {$VFS.FS.FREE.MIN.WARN:"{#FSNAME}"}.
- The disk will be full in less than 24 hours.

last(/Mellanox by SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}])>{$VFS.FS.PUSED.MAX.WARN:"{#FSNAME}"} and ((last(/Mellanox by SNMP/vfs.fs.total[hrStorageSize.{#SNMPINDEX}])-last(/Mellanox by SNMP/vfs.fs.used[hrStorageUsed.{#SNMPINDEX}]))<{$VFS.FS.FREE.MIN.WARN:"{#FSNAME}"} or timeleft(/Mellanox by SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}],1h,100)<1d) Warning Manual close: Yes
Depends on:
  • Mellanox: {#FSNAME}: Disk space is critically low

LLD rule Memory discovery

Name Description Type Key and additional info
Memory discovery

HOST-RESOURCES-MIB::hrStorage discovery with memory filter

SNMP agent vm.memory.discovery

Preprocessing

  • Discard unchanged with heartbeat: 6h

Item prototypes for Memory discovery

Name Description Type Key and additional info
{#MEMNAME}: Used memory

MIB: HOST-RESOURCES-MIB

The amount of the storage represented by this entry that is allocated, in units of hrStorageAllocationUnits.

SNMP agent vm.memory.used[hrStorageUsed.{#SNMPINDEX}]

Preprocessing

  • Custom multiplier: {#ALLOC_UNITS}

{#MEMNAME}: Total memory

MIB: HOST-RESOURCES-MIB

The size of the storage represented by this entry, in units of hrStorageAllocationUnits.

This object is writable to allow remote configuration of the size of the storage area in those cases where such an operation makes sense and is possible on the underlying system.

For example, the amount of main memory allocated to a buffer pool might be modified or the amount of disk space allocated to virtual memory might be modified.

SNMP agent vm.memory.total[hrStorageSize.{#SNMPINDEX}]

Preprocessing

  • Custom multiplier: {#ALLOC_UNITS}

{#MEMNAME}: Memory utilization

Memory utilization in %.

Calculated vm.memory.util[memoryUsedPercentage.{#SNMPINDEX}]

Trigger prototypes for Memory discovery

Name Description Expression Severity Dependencies and additional info
Mellanox: {#MEMNAME}: High memory utilization

The system is running out of free memory.

min(/Mellanox by SNMP/vm.memory.util[memoryUsedPercentage.{#SNMPINDEX}],5m)>{$MEMORY.UTIL.MAX} Average

LLD rule Network interfaces discovery

Name Description Type Key and additional info
Network interfaces discovery

Discovering interfaces from IF-MIB.

SNMP agent net.if.discovery

Preprocessing

  • Discard unchanged with heartbeat: 6h

Item prototypes for Network interfaces discovery

Name Description Type Key and additional info
Interface {#IFNAME}({#IFALIAS}): Operational status

MIB: IF-MIB

The current operational state of the interface.

- The testing(3) state indicates that no operational packet scan be passed

- If ifAdminStatus is down(2) then ifOperStatus should be down(2)

- If ifAdminStatus is changed to up(1) then ifOperStatus should change to up(1) if the interface is ready to transmit and receive network traffic

- It should change todormant(5) if the interface is waiting for external actions (such as a serial line waiting for an incoming connection)

- It should remain in the down(2) state if and only if there is a fault that prevents it from going to the up(1) state

- It should remain in the notPresent(6) state if the interface has missing(typically, hardware) components.

SNMP agent net.if.status[ifOperStatus.{#SNMPINDEX}]
Interface {#IFNAME}({#IFALIAS}): Bits received

MIB: IF-MIB

The total number of octets received on the interface, including framing characters. This object is a 64-bit version of ifInOctets. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.in[ifHCInOctets.{#SNMPINDEX}]

Preprocessing

  • Change per second
  • Custom multiplier: 8

Interface {#IFNAME}({#IFALIAS}): Bits sent

MIB: IF-MIB

The total number of octets transmitted out of the interface, including framing characters. This object is a 64-bit version of ifOutOctets.Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.out[ifHCOutOctets.{#SNMPINDEX}]

Preprocessing

  • Change per second
  • Custom multiplier: 8

Interface {#IFNAME}({#IFALIAS}): Inbound packets with errors

MIB: IF-MIB

For packet-oriented interfaces, the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of inbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.in.errors[ifInErrors.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Outbound packets with errors

MIB: IF-MIB

For packet-oriented interfaces, the number of outbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of outbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.out.errors[ifOutErrors.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Outbound packets discarded

MIB: IF-MIB

The number of outbound packets which were chosen to be discarded

even though no errors had been detected to prevent their being deliverable to a higher-layer protocol.

One possible reason for discarding such a packet could be to free up buffer space.

Discontinuities in the value of this counter can occur at re-initialization of the management system,

and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.out.discards[ifOutDiscards.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Inbound packets discarded

MIB: IF-MIB

The number of inbound packets which were chosen to be discarded

even though no errors had been detected to prevent their being deliverable to a higher-layer protocol.

One possible reason for discarding such a packet could be to free up buffer space.

Discontinuities in the value of this counter can occur at re-initialization of the management system,

and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.in.discards[ifInDiscards.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Interface type

MIB: IF-MIB

The type of interface.

Additional values for ifType are assigned by the Internet Assigned Numbers Authority (IANA),

through updating the syntax of the IANAifType textual convention.

SNMP agent net.if.type[ifType.{#SNMPINDEX}]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Interface {#IFNAME}({#IFALIAS}): Speed

MIB: IF-MIB

An estimate of the interface's current bandwidth in units of 1,000,000 bits per second. If this object reports a value of n' then the speed of the interface is somewhere in the range of n-500,000' to`n+499,999'. For interfaces which do not vary in bandwidth or for those where no accurate estimation can be made, this object should contain the nominal bandwidth. For a sub-layer which has no concept of bandwidth, this object should be zero.

SNMP agent net.if.speed[ifHighSpeed.{#SNMPINDEX}]

Preprocessing

  • Custom multiplier: 1000000

  • Discard unchanged with heartbeat: 1h

Trigger prototypes for Network interfaces discovery

Name Description Expression Severity Dependencies and additional info
Mellanox: Interface {#IFNAME}({#IFALIAS}): Link down

This trigger expression works as follows:
1. It can be triggered if the operations status is down.
2. {$IFCONTROL:"{#IFNAME}"}=1 - a user can redefine context macro to value - 0. That marks this interface as not important. No new trigger will be fired if this interface is down.
3. {TEMPLATE_NAME:METRIC.diff()}=1 - the trigger fires only if the operational status was up to (1) sometime before (so, do not fire for the 'eternal off' interfaces.)

WARNING: if closed manually - it will not fire again on the next poll, because of .diff.

{$IFCONTROL:"{#IFNAME}"}=1 and last(/Mellanox by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])=2 and (last(/Mellanox by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#1)<>last(/Mellanox by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#2)) Average Manual close: Yes
Mellanox: Interface {#IFNAME}({#IFALIAS}): High bandwidth usage

The utilization of the network interface is close to its estimated maximum bandwidth.

(avg(/Mellanox by SNMP/net.if.in[ifHCInOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Mellanox by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) or avg(/Mellanox by SNMP/net.if.out[ifHCOutOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Mellanox by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])) and last(/Mellanox by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 Warning Manual close: Yes
Depends on:
  • Mellanox: Interface {#IFNAME}({#IFALIAS}): Link down
Mellanox: Interface {#IFNAME}({#IFALIAS}): High error rate

It recovers when it is below 80% of the {$IF.ERRORS.WARN:"{#IFNAME}"} threshold.

min(/Mellanox by SNMP/net.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} or min(/Mellanox by SNMP/net.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} Warning Manual close: Yes
Depends on:
  • Mellanox: Interface {#IFNAME}({#IFALIAS}): Link down
Mellanox: Interface {#IFNAME}({#IFALIAS}): Ethernet has changed to lower speed than it was before

This Ethernet connection has transitioned down from its known maximum speed. This might be a sign of autonegotiation issues. Acknowledge to close the problem manually.

change(/Mellanox by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])<0 and last(/Mellanox by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 and ( last(/Mellanox by SNMP/net.if.type[ifType.{#SNMPINDEX}])=6 or last(/Mellanox by SNMP/net.if.type[ifType.{#SNMPINDEX}])=7 or last(/Mellanox by SNMP/net.if.type[ifType.{#SNMPINDEX}])=11 or last(/Mellanox by SNMP/net.if.type[ifType.{#SNMPINDEX}])=62 or last(/Mellanox by SNMP/net.if.type[ifType.{#SNMPINDEX}])=69 or last(/Mellanox by SNMP/net.if.type[ifType.{#SNMPINDEX}])=117 ) and (last(/Mellanox by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])<>2) Info Manual close: Yes
Depends on:
  • Mellanox: Interface {#IFNAME}({#IFALIAS}): Link down

Feedback

Please report any issues with the template at https://support.zabbix.com

You can also provide feedback, discuss the template, or ask for help at ZABBIX forums

Články a dokumentace

+ Navrhněte nový článek

Nenašli jste integraci, kterou potřebujete?