Hi,
Background: Our setup has 2 Zabbix servers (in HA cluster), and all collection is made by Docker-deployed Zabbix proxies in different network zones.
We have around 2500 hosts - mostly VMs monitored by agents. We will be adding lots of different network devices, most of which have templates already available, but some platforms will need custom developed templates.
With our old monitoring system, we would be able to map interesting fields in the traps to a decent event model and simply display those.
In Zabbix, it seems we need to create an item for every single event type, meaning we have to preemptively know exactly what devices are able to send.
The concrete case is a trap that has 5 fields to determine its uniqueness:
- NodeId
- AlarmName
- ApplicationName
- InstanceId
- TableKey
Then of course they have severity, description and a bunch of other fields. So far I've seen >100 different combinations of these fields.
I've seen you can use tags and recovery expressions to solve this partly.
Is this really the best way?
- Do I need to use tons of regex expressions to extract the tags for every single trigger?
- What if an alarm can have multiple different severities (not just up/down)? Do I need a separate trigger for each of those?
- Does it make sense to use a trap parser like SNMPTT, and will this break existing templates using OIDs/SNMP names?
Comment