Mellanox

Mellanox Technologies Ltd. is an Israeli-American multinational supplier of computer networking products based on InfiniBand and Ethernet technology.

Available solutions




This template is for Zabbix version: 6.2
Also available for: 6.0 5.4 5.0

Source: https://git.zabbix.com/projects/ZBX/repos/zabbix/browse/templates/net/mellanox_snmp?at=release/6.2

Mellanox SNMP

Overview

For Zabbix version: 6.2 and higher
The updated template for monitoring the Mellanox network switches over SNMP agent. All items collected in one template without any linked templates.

Setup

See Zabbix template operation for basic instructions.

Refer to the vendor documentation.

Zabbix configuration

The template uses context macros for the temperature trigger expression. By default, it uses a macro value like {$TEMP.MAX.CRIT}. To adjust the threshold for a certain sensor you can define context macros on the host level, with a value corresponding to your device specifications, for example: {$TEMP.MAX.CRIT:"MGMT/BOARD_MONITOR"}. Please, read https://www.zabbix.com/documentation/6.2/manual/config/macros/user_macros_context for more detailed info on user context macros.

Macros used

Name Description Default
{$CPU.UTIL.CRIT}

-

90
{$FAN_CRIT_STATUS}

The critical value of the FAN sensor for trigger expression.

3
{$ICMP.LOSS.WARN}

-

20
{$ICMP.RESPONSE_TIME.WARN}

-

0.15
{$ICMP_LOSS_WARN}

-

20
{$ICMP_RESPONSE_TIME_WARN}

-

0.15
{$IF.ERRORS.WARN}

-

2
{$IF.UTIL.MAX}

-

90
{$IFCONTROL}

-

1
{$MEMORY.NAME.MATCHES}

This macro is used in memory discovery. Can be overridden on the host level.

.*
{$MEMORY.NAME.NOT_MATCHES}

This macro is used in memory discovery. Can be overridden on the host level if you need to filter out results.

CHANGE_IF_NEEDED
{$MEMORY.TYPE.MATCHES}

This macro is used in memory discovery. Can be overridden on the host level.

.*(\.2|hrStorageRam)$
{$MEMORY.TYPE.NOT_MATCHES}

This macro is used in memory discovery. Can be overridden on the host level if you need to filter out results.

CHANGE_IF_NEEDED
{$MEMORY.UTIL.MAX}

The warning threshold of the "Physical memory: Memory utilization" item.

90
{$NET.IF.IFADMINSTATUS.MATCHES}

Ignore notPresent(6)

^.*
{$NET.IF.IFADMINSTATUS.NOT_MATCHES}

Ignore down(2) administrative status.

^2$
{$NET.IF.IFALIAS.MATCHES}

-

.*
{$NET.IF.IFALIAS.NOT_MATCHES}

-

CHANGE_IF_NEEDED
{$NET.IF.IFDESCR.MATCHES}

-

.*
{$NET.IF.IFDESCR.NOT_MATCHES}

-

CHANGE_IF_NEEDED
{$NET.IF.IFNAME.MATCHES}

-

^.*$
{$NET.IF.IFNAME.NOT_MATCHES}

Filter out loopbacks, nulls, docker veth links and docker0 bridge by default.

(^Software Loopback Interface|^NULL[0-9.]*$|^[Ll]o[0-9.]*$|^[Ss]ystem$|^Nu[0-9.]*$|^veth[0-9a-z]+$|docker[0-9]+|br-[a-z0-9]{12})
{$NET.IF.IFOPERSTATUS.MATCHES}

-

^.*$
{$NET.IF.IFOPERSTATUS.NOT_MATCHES}

Ignore notPresent(6).

^6$
{$NET.IF.IFTYPE.MATCHES}

-

.*
{$NET.IF.IFTYPE.NOT_MATCHES}

-

CHANGE_IF_NEEDED
{$PSU.STATUS.CRIT}

The critical value of the PSU sensor for trigger expression.

2
{$SNMP.TIMEOUT}

The time interval for SNMP agent availability trigger expression.

5m
{$TEMP.MAX.CRIT}

The temperature maximum critical value for trigger expression.

60
{$TEMP.MAX.WARN}

The temperature maximum warning value for trigger expression.

50
{$TEMP.MIN.CRIT}

The temperature minimum critical value for trigger expression.

5
{$TEMP.STATUS.WARN}

The critical value of the TEMP sensor for trigger expression.

3
{$VFS.FS.FREE.MIN.CRIT}

The critical threshold of the filesystem utilization.

5G
{$VFS.FS.FREE.MIN.WARN}

The warning threshold of the filesystem utilization.

10G
{$VFS.FS.FSNAME.MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

.+
{$VFS.FS.FSNAME.NOT_MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

^(/dev|/sys|/$|/run|/proc|.+/shm$)
{$VFS.FS.FSTYPE.MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

.*(\.4|\.9|hrStorageFixedDisk|hrStorageFlashMemory)$
{$VFS.FS.FSTYPE.NOT_MATCHES}

This macro is used in filesystems discovery. Can be overridden on the host level.

CHANGE_IF_NEEDED
{$VFS.FS.PUSED.MAX.CRIT}

-

90
{$VFS.FS.PUSED.MAX.WARN}

-

80

Template links

There are no template links in this template.

Discovery rules

Name Description Type Key and additional info
Entity Discovery

-

SNMP entity.discovery

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 6h

Filter:

AND_OR

- {#ENT_CLASS} MATCHES_REGEX 3

Fan Discovery

ENTITY-SENSORS-MIB::EntitySensorDataType discovery with rpm filter

SNMP fan.discovery

Preprocessing:

- JAVASCRIPT: The text is too long. Please see the template.

- DISCARD_UNCHANGED_HEARTBEAT: 6h

Filter:

OR

- {#SNMPVALUE} MATCHES_REGEX 10

Memory discovery

HOST-RESOURCES-MIB::hrStorage discovery with memory filter

SNMP vm.memory.discovery

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 6h

Filter:

AND

- {#MEMTYPE} MATCHES_REGEX {$MEMORY.TYPE.MATCHES}

- {#MEMTYPE} NOT_MATCHES_REGEX {$MEMORY.TYPE.NOT_MATCHES}

- {#MEMNAME} MATCHES_REGEX {$MEMORY.NAME.MATCHES}

- {#MEMNAME} NOT_MATCHES_REGEX {$MEMORY.NAME.NOT_MATCHES}

Network interfaces discovery

Discovering interfaces from IF-MIB.

SNMP net.if.discovery

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 6h

Filter:

AND

- {#IFADMINSTATUS} MATCHES_REGEX {$NET.IF.IFADMINSTATUS.MATCHES}

- {#IFADMINSTATUS} NOT_MATCHES_REGEX {$NET.IF.IFADMINSTATUS.NOT_MATCHES}

- {#IFOPERSTATUS} MATCHES_REGEX {$NET.IF.IFOPERSTATUS.MATCHES}

- {#IFOPERSTATUS} NOT_MATCHES_REGEX {$NET.IF.IFOPERSTATUS.NOT_MATCHES}

- {#IFNAME} MATCHES_REGEX {$NET.IF.IFNAME.MATCHES}

- {#IFNAME} NOT_MATCHES_REGEX {$NET.IF.IFNAME.NOT_MATCHES}

- {#IFDESCR} MATCHES_REGEX {$NET.IF.IFDESCR.MATCHES}

- {#IFDESCR} NOT_MATCHES_REGEX {$NET.IF.IFDESCR.NOT_MATCHES}

- {#IFALIAS} MATCHES_REGEX {$NET.IF.IFALIAS.MATCHES}

- {#IFALIAS} NOT_MATCHES_REGEX {$NET.IF.IFALIAS.NOT_MATCHES}

- {#IFTYPE} MATCHES_REGEX {$NET.IF.IFTYPE.MATCHES}

- {#IFTYPE} NOT_MATCHES_REGEX {$NET.IF.IFTYPE.NOT_MATCHES}

PSU Discovery

-

SNMP psu.discovery

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 6h

Filter:

AND_OR

- {#ENT_CLASS} MATCHES_REGEX 6

Storage discovery

HOST-RESOURCES-MIB::hrStorage discovery with storage filter.

SNMP vfs.fs.discovery[snmp]

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 6h

Filter:

AND

- {#FSTYPE} MATCHES_REGEX {$VFS.FS.FSTYPE.MATCHES}

- {#FSTYPE} NOT_MATCHES_REGEX {$VFS.FS.FSTYPE.NOT_MATCHES}

- {#FSNAME} MATCHES_REGEX {$VFS.FS.FSNAME.MATCHES}

- {#FSNAME} NOT_MATCHES_REGEX {$VFS.FS.FSNAME.NOT_MATCHES}

Temperature Discovery

ENTITY-SENSORS-MIB::EntitySensorDataType discovery with temperature filter

SNMP temp.discovery

Preprocessing:

- JAVASCRIPT: The text is too long. Please see the template.

- DISCARD_UNCHANGED_HEARTBEAT: 6h

Filter:

AND

- {#SENSOR_TYPE} MATCHES_REGEX 8

- {#SENSOR_PRECISION} MATCHES_REGEX 1

Items collected

Group Name Description Type Key and additional info
CPU CPU utilization

MIB: HOST-RESOURCES-MIB

The average, over the last minute, of the percentage of time that processors was not idle.

Implementations may approximate this one minute smoothing period if necessary.

SNMP system.cpu.util

Preprocessing:

- JSONPATH: $..['{#CPU.UTIL}'].avg()

Fans {#SENSOR_INFO}: Fan speed

MIB: ENTITY-SENSORS-MIB

The most recent measurement obtained by the agent for this sensor.

To correctly interpret the value of this object, the associated entPhySensorType,

entPhySensorScale, and entPhySensorPrecision objects must also be examined.

SNMP sensor.fan.speed[entPhySensorValue.{#SNMPINDEX}]
Fans {#SENSOR_INFO}: Fan status

MIB: ENTITY-SENSORS-MIB

The operational status of the sensor {#SENSOR_INFO}

SNMP sensor.fan.status[entPhySensorOperStatus.{#SNMPINDEX}]
General SNMP traps (fallback)

The item is used to collect all SNMP traps unmatched by other snmptrap items

SNMP_TRAP snmptrap.fallback
General System location

MIB: SNMPv2-MIB

The physical location of this node (e.g., telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string.</p> |SNMP |system.location[sysLocation.0]<p>**Preprocessing**:</p><p>- DISCARD_UNCHANGED_HEARTBEAT:12h`

General System contact details

MIB: SNMPv2-MIB

The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string.

SNMP system.contact[sysContact.0]

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 12h

General System object ID

MIB: SNMPv2-MIB

The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determiningwhat kind of box' is being managed. For example, if vendorFlintstones, Inc.' was assigned the subtree1.3.6.1.4.1.4242, it could assign the identifier 1.3.6.1.4.1.4242.1.1 to its Fred Router'.</p> |SNMP |system.objectid[sysObjectID.0]<p>**Preprocessing**:</p><p>- DISCARD_UNCHANGED_HEARTBEAT:12h`

General System name

MIB: SNMPv2-MIB

An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string.

SNMP system.name

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 12h

General System description

MIB: SNMPv2-MIB

A textual description of the entity. This value should

include the full name and version identification of the system's hardware type, software operating-system, and

networking software.

SNMP system.descr[sysDescr.0]

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 12h

Inventory {#ENT_NAME}: Hardware model name

MIB: ENTITY-MIB

SNMP system.hw.model[entPhysicalModelName.{#SNMPINDEX}]

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 1d

Inventory {#ENT_NAME}: Hardware serial number

MIB: ENTITY-MIB

SNMP system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}]

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 1d

Memory {#MEMNAME}: Used memory

MIB: HOST-RESOURCES-MIB

The amount of the storage represented by this entry that is allocated, in units of hrStorageAllocationUnits.

SNMP vm.memory.used[hrStorageUsed.{#SNMPINDEX}]

Preprocessing:

- MULTIPLIER: {#ALLOC_UNITS}

Memory {#MEMNAME}: Total memory

MIB: HOST-RESOURCES-MIB

The size of the storage represented by this entry, in units of hrStorageAllocationUnits.

This object is writable to allow remote configuration of the size of the storage area in those cases where such an operation makes sense and is possible on the underlying system.

For example, the amount of main memory allocated to a buffer pool might be modified or the amount of disk space allocated to virtual memory might be modified.

SNMP vm.memory.total[hrStorageSize.{#SNMPINDEX}]

Preprocessing:

- MULTIPLIER: {#ALLOC_UNITS}

Memory {#MEMNAME}: Memory utilization

Memory utilization in %.

CALCULATED vm.memory.util[memoryUsedPercentage.{#SNMPINDEX}]

Expression:

last(//vm.memory.used[hrStorageUsed.{#SNMPINDEX}])/last(//vm.memory.total[hrStorageSize.{#SNMPINDEX}])*100
Network interfaces Interface {#IFNAME}({#IFALIAS}): Operational status

MIB: IF-MIB

The current operational state of the interface.

- The testing(3) state indicates that no operational packet scan be passed

- If ifAdminStatus is down(2) then ifOperStatus should be down(2)

- If ifAdminStatus is changed to up(1) then ifOperStatus should change to up(1) if the interface is ready to transmit and receive network traffic

- It should change todormant(5) if the interface is waiting for external actions (such as a serial line waiting for an incoming connection)

- It should remain in the down(2) state if and only if there is a fault that prevents it from going to the up(1) state

- It should remain in the notPresent(6) state if the interface has missing(typically, hardware) components.

SNMP net.if.status[ifOperStatus.{#SNMPINDEX}]
Network interfaces Interface {#IFNAME}({#IFALIAS}): Bits received

MIB: IF-MIB

The total number of octets received on the interface, including framing characters. This object is a 64-bit version of ifInOctets. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP net.if.in[ifHCInOctets.{#SNMPINDEX}]

Preprocessing:

- CHANGE_PER_SECOND

- MULTIPLIER: 8

Network interfaces Interface {#IFNAME}({#IFALIAS}): Bits sent

MIB: IF-MIB

The total number of octets transmitted out of the interface, including framing characters. This object is a 64-bit version of ifOutOctets.Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP net.if.out[ifHCOutOctets.{#SNMPINDEX}]

Preprocessing:

- CHANGE_PER_SECOND

- MULTIPLIER: 8

Network interfaces Interface {#IFNAME}({#IFALIAS}): Inbound packets with errors

MIB: IF-MIB

For packet-oriented interfaces, the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of inbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP net.if.in.errors[ifInErrors.{#SNMPINDEX}]

Preprocessing:

- CHANGE_PER_SECOND

Network interfaces Interface {#IFNAME}({#IFALIAS}): Outbound packets with errors

MIB: IF-MIB

For packet-oriented interfaces, the number of outbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of outbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP net.if.out.errors[ifOutErrors.{#SNMPINDEX}]

Preprocessing:

- CHANGE_PER_SECOND

Network interfaces Interface {#IFNAME}({#IFALIAS}): Outbound packets discarded

MIB: IF-MIB

The number of outbound packets which were chosen to be discarded

even though no errors had been detected to prevent their being deliverable to a higher-layer protocol.

One possible reason for discarding such a packet could be to free up buffer space.

Discontinuities in the value of this counter can occur at re-initialization of the management system,

and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP net.if.out.discards[ifOutDiscards.{#SNMPINDEX}]

Preprocessing:

- CHANGE_PER_SECOND

Network interfaces Interface {#IFNAME}({#IFALIAS}): Inbound packets discarded

MIB: IF-MIB

The number of inbound packets which were chosen to be discarded

even though no errors had been detected to prevent their being deliverable to a higher-layer protocol.

One possible reason for discarding such a packet could be to free up buffer space.

Discontinuities in the value of this counter can occur at re-initialization of the management system,

and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP net.if.in.discards[ifInDiscards.{#SNMPINDEX}]

Preprocessing:

- CHANGE_PER_SECOND

Network interfaces Interface {#IFNAME}({#IFALIAS}): Interface type

MIB: IF-MIB

The type of interface.

Additional values for ifType are assigned by the Internet Assigned Numbers Authority (IANA),

through updating the syntax of the IANAifType textual convention.

SNMP net.if.type[ifType.{#SNMPINDEX}]

Preprocessing:

- DISCARD_UNCHANGED_HEARTBEAT: 1d

Network interfaces Interface {#IFNAME}({#IFALIAS}): Speed

MIB: IF-MIB

An estimate of the interface's current bandwidth in units of 1,000,000 bits per second. If this object reports a value of n' then the speed of the interface is somewhere in the range ofn-500,000' ton+499,999'. For interfaces which do not vary in bandwidth or for those where no accurate estimation can be made, this object should contain the nominal bandwidth. For a sub-layer which has no concept of bandwidth, this object should be zero.</p> |SNMP |net.if.speed[ifHighSpeed.{#SNMPINDEX}]<p>**Preprocessing**:</p><p>- MULTIPLIER:1000000</p><p>- DISCARD_UNCHANGED_HEARTBEAT:1h`

Power supply {#ENT_NAME}: Power supply status

MIB: ENTITY-STATE-MIB

SNMP sensor.psu.status[entStateOper.{#SNMPINDEX}]
Status Uptime (network)

MIB: SNMPv2-MIB

The time (in hundredths of a second) since the network management portion of the system was last re-initialized.

SNMP system.net.uptime[sysUpTime.0]

Preprocessing:

- MULTIPLIER: 0.01

Status Uptime (hardware)

MIB: HOST-RESOURCES-MIB

The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system.

SNMP system.hw.uptime[hrSystemUptime.0]

Preprocessing:

- CHECK_NOT_SUPPORTED

⛔️ON_FAIL: CUSTOM_VALUE -> 0

- MULTIPLIER: 0.01

Status SNMP agent availability

Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list.

Possible value:

0 - not available

1 - available

2 - unknown

INTERNAL zabbix[host,snmp,available]
Status ICMP ping

-

SIMPLE icmpping
Status ICMP loss

-

SIMPLE icmppingloss
Status ICMP response time

-

SIMPLE icmppingsec
Storage {#FSNAME}: Used space

MIB: HOST-RESOURCES-MIB

The amount of the storage represented by this entry that is allocated, in units of hrStorageAllocationUnits.

SNMP vfs.fs.used[hrStorageUsed.{#SNMPINDEX}]

Preprocessing:

- MULTIPLIER: {#ALLOC_UNITS}

Storage {#FSNAME}: Total space

MIB: HOST-RESOURCES-MIB

The size of the storage represented by this entry, in units of hrStorageAllocationUnits.

This object is writable to allow remote configuration of the size of the storage area in those cases where such an operation makes sense and is possible on the underlying system.

For example, the amount of main storage allocated to a buffer pool might be modified or the amount of disk space allocated to virtual storage might be modified.

SNMP vfs.fs.total[hrStorageSize.{#SNMPINDEX}]

Preprocessing:

- MULTIPLIER: {#ALLOC_UNITS}

Storage {#FSNAME}: Space utilization

Space utilization in % for {#FSNAME}

CALCULATED vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}]

Expression:

(last(//vfs.fs.used[hrStorageUsed.{#SNMPINDEX}])/last(//vfs.fs.total[hrStorageSize.{#SNMPINDEX}]))*100
Temperature {#SENSOR_INFO}: Temperature

MIB: ENTITY-SENSORS-MIB

The most recent measurement obtained by the agent for this sensor.

To correctly interpret the value of this object, the associated entPhySensorType,

entPhySensorScale, and entPhySensorPrecision objects must also be examined.

SNMP sensor.temp.value[entPhySensorValue.{#SNMPINDEX}]

Preprocessing:

- MULTIPLIER: 0.1

Temperature {#SENSOR_INFO}: Temperature status

MIB: ENTITY-SENSORS-MIB

The operational status of the sensor {#SENSOR_INFO}. Possible values:

- ok(1) indicates that the agent can obtain the sensor value.

- unavailable(2) indicates that the agent presently cannot obtain the sensor value.

- nonoperational(3) indicates that the agent believes the sensor is broken. The sensor could have a hard failure (disconnected wire), or a soft failure such as out-of-range, jittery, or wildly fluctuating readings.

SNMP sensor.temp.status[entPhySensorOperStatus.{#SNMPINDEX}]

Triggers

Name Description Expression Severity Dependencies and additional info
High CPU utilization

CPU utilization is too high. The system might be slow to respond.

min(/Mellanox SNMP/system.cpu.util,5m)>{$CPU.UTIL.CRIT} WARNING
{#SENSOR_INFO}: Fan is in critical state

Please check the fan unit

count(/Mellanox SNMP/sensor.fan.status[entPhySensorOperStatus.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS}")=1 AVERAGE
System name has changed

System name has changed. Ack to close.

last(/Mellanox SNMP/system.name,#1)<>last(/Mellanox SNMP/system.name,#2) and length(last(/Mellanox SNMP/system.name))>0 INFO

Manual close: YES

{#ENT_NAME}: Device has been replaced

Device serial number has changed. Ack to close

last(/Mellanox SNMP/system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}],#1)<>last(/Mellanox SNMP/system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}],#2) and length(last(/Mellanox SNMP/system.hw.serialnumber[entPhysicalSerialNum.{#SNMPINDEX}]))>0 INFO

Manual close: YES

{#MEMNAME}: High memory utilization

The system is running out of free memory.

min(/Mellanox SNMP/vm.memory.util[memoryUsedPercentage.{#SNMPINDEX}],5m)>{$MEMORY.UTIL.MAX} AVERAGE
Interface {#IFNAME}({#IFALIAS}): Link down

This trigger expression works as follows:

1. Can be triggered if operations status is down.

2. {$IFCONTROL:"{#IFNAME}"}=1 - user can redefine Context macro to value - 0. That marks this interface as not important. No new trigger will be fired if this interface is down.

3. {TEMPLATE_NAME:METRIC.diff()}=1) - trigger fires only if operational status was up(1) sometime before. (So, do not fire 'ethernal off' interfaces.)

WARNING: if closed manually - won't fire again on next poll, because of .diff.

{$IFCONTROL:"{#IFNAME}"}=1 and last(/Mellanox SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])=2 and (last(/Mellanox SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#1)<>last(/Mellanox SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#2))

Recovery expression:

last(/Mellanox SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])<>2 or {$IFCONTROL:"{#IFNAME}"}=0
AVERAGE

Manual close: YES

Interface {#IFNAME}({#IFALIAS}): High bandwidth usage

The network interface utilization is close to its estimated maximum bandwidth.

(avg(/Mellanox SNMP/net.if.in[ifHCInOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) or avg(/Mellanox SNMP/net.if.out[ifHCOutOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])) and last(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0

Recovery expression:

avg(/Mellanox SNMP/net.if.in[ifHCInOctets.{#SNMPINDEX}],15m)<(({$IF.UTIL.MAX:"{#IFNAME}"}-3)/100)*last(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) and avg(/Mellanox SNMP/net.if.out[ifHCOutOctets.{#SNMPINDEX}],15m)<(({$IF.UTIL.MAX:"{#IFNAME}"}-3)/100)*last(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])
WARNING

Manual close: YES

Depends on:

- Interface {#IFNAME}({#IFALIAS}): Link down

Interface {#IFNAME}({#IFALIAS}): High error rate

Recovers when below 80% of {$IF.ERRORS.WARN:"{#IFNAME}"} threshold

min(/Mellanox SNMP/net.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} or min(/Mellanox SNMP/net.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"}

Recovery expression:

max(/Mellanox SNMP/net.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)<{$IF.ERRORS.WARN:"{#IFNAME}"}*0.8 and max(/Mellanox SNMP/net.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)<{$IF.ERRORS.WARN:"{#IFNAME}"}*0.8
WARNING

Manual close: YES

Depends on:

- Interface {#IFNAME}({#IFALIAS}): Link down

Interface {#IFNAME}({#IFALIAS}): Ethernet has changed to lower speed than it was before

This Ethernet connection has transitioned down from its known maximum speed. This might be a sign of autonegotiation issues. Ack to close.

change(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])<0 and last(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 and ( last(/Mellanox SNMP/net.if.type[ifType.{#SNMPINDEX}])=6 or last(/Mellanox SNMP/net.if.type[ifType.{#SNMPINDEX}])=7 or last(/Mellanox SNMP/net.if.type[ifType.{#SNMPINDEX}])=11 or last(/Mellanox SNMP/net.if.type[ifType.{#SNMPINDEX}])=62 or last(/Mellanox SNMP/net.if.type[ifType.{#SNMPINDEX}])=69 or last(/Mellanox SNMP/net.if.type[ifType.{#SNMPINDEX}])=117 ) and (last(/Mellanox SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])<>2)

Recovery expression:

(change(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 and last(/Mellanox SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}],#2)>0) or (last(/Mellanox SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])=2)
INFO

Manual close: YES

Depends on:

- Interface {#IFNAME}({#IFALIAS}): Link down

{#ENT_NAME}: Power supply is in critical state

Please check the power supply unit for errors

count(/Mellanox SNMP/sensor.psu.status[entStateOper.{#SNMPINDEX}],#1,"eq","{$PSU.STATUS.CRIT}")=1 AVERAGE
Host has been restarted

Uptime is less than 10 minutes.

(last(/Mellanox SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/Mellanox SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/Mellanox SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/Mellanox SNMP/system.net.uptime[sysUpTime.0])<10m) WARNING

Manual close: YES

Depends on:

- No SNMP data collection

No SNMP data collection

SNMP is not available for polling. Please check device connectivity and SNMP settings.

max(/Mellanox SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 WARNING

Depends on:

- Unavailable by ICMP ping

Unavailable by ICMP ping

Last three attempts returned timeout. Please check device connectivity.

max(/Mellanox SNMP/icmpping,#3)=0 HIGH
High ICMP ping loss

-

min(/Mellanox SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/Mellanox SNMP/icmppingloss,5m)<100 WARNING

Depends on:

- Unavailable by ICMP ping

High ICMP ping response time

-

avg(/Mellanox SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} WARNING

Depends on:

- High ICMP ping loss

- Unavailable by ICMP ping

{#FSNAME}: Disk space is critically low

Two conditions should match: First, space utilization should be above {$VFS.FS.PUSED.MAX.CRIT:"{#FSNAME}"}.

Second condition should be one of the following:

- The disk free space is less than {$VFS.FS.FREE.MIN.CRIT:"{#FSNAME}"}.

- The disk will be full in less than 24 hours.

last(/Mellanox SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}])>{$VFS.FS.PUSED.MAX.CRIT:"{#FSNAME}"} and ((last(/Mellanox SNMP/vfs.fs.total[hrStorageSize.{#SNMPINDEX}])-last(/Mellanox SNMP/vfs.fs.used[hrStorageUsed.{#SNMPINDEX}]))<{$VFS.FS.FREE.MIN.CRIT:"{#FSNAME}"} or timeleft(/Mellanox SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}],1h,100)<1d) AVERAGE

Manual close: YES

{#FSNAME}: Disk space is low

Two conditions should match: First, space utilization should be above {$VFS.FS.PUSED.MAX.WARN:"{#FSNAME}"}.

Second condition should be one of the following:

- The disk free space is less than {$VFS.FS.FREE.MIN.WARN:"{#FSNAME}"}.

- The disk will be full in less than 24 hours.

last(/Mellanox SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}])>{$VFS.FS.PUSED.MAX.WARN:"{#FSNAME}"} and ((last(/Mellanox SNMP/vfs.fs.total[hrStorageSize.{#SNMPINDEX}])-last(/Mellanox SNMP/vfs.fs.used[hrStorageUsed.{#SNMPINDEX}]))<{$VFS.FS.FREE.MIN.WARN:"{#FSNAME}"} or timeleft(/Mellanox SNMP/vfs.fs.pused[storageUsedPercentage.{#SNMPINDEX}],1h,100)<1d) WARNING

Manual close: YES

Depends on:

- {#FSNAME}: Disk space is critically low

{#SENSOR_INFO}: Temperature is above warning threshold

This trigger uses temperature sensor values as well as temperature sensor status if available

avg(/Mellanox SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)>{$TEMP.MAX.WARN:"{#SENSOR_INFO}"} or last(/Mellanox SNMP/sensor.temp.status[entPhySensorOperStatus.{#SNMPINDEX}])={$TEMP.STATUS.WARN}

Recovery expression:

max(/Mellanox SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)<{$TEMP.MAX.WARN:"{#SENSOR_INFO}"}-3
WARNING

Depends on:

- {#SENSOR_INFO}: Temperature is above critical threshold

{#SENSOR_INFO}: Temperature is above critical threshold

This trigger uses temperature sensor values as well as temperature sensor status if available

avg(/Mellanox SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)>{$TEMP.MAX.CRIT:"{#SENSOR_INFO}"}

Recovery expression:

max(/Mellanox SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)<{$TEMP.MAX.CRIT:"{#SENSOR_INFO}"}-3
HIGH
{#SENSOR_INFO}: Temperature is too low

-

avg(/Mellanox SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)<{$TEMP.MIN.CRIT:"{#SENSOR_INFO}"}

Recovery expression:

min(/Mellanox SNMP/sensor.temp.value[entPhySensorValue.{#SNMPINDEX}],5m)>{$TEMP.MIN.CRIT:"{#SENSOR_INFO}"}+3
AVERAGE

Feedback

Please report any issues with the template at https://support.zabbix.com

Articles and documentation

+ Propose new article

Sie können die Integration nicht finden, die Sie benötigen?