Hi,
for some time now I am running this project that uses GPU resources as a main basis.
I have several docker containers running, and each use different amount of GPU and VRAM.
Is there a way to monitor GPU usage of those containers each with zabbix?
f.e. container1 uses 18%...
Search Result
Collapse
5 results in 0.0018 seconds.
Keywords
Members
Tags
-
GPU usage per docker container with zabbix
-
Help with Nvidia Sensors monitoring
I find this official template for Nvidia GPU https://github.com/zabbix/community-.../5.0/README.md
But when I try to add the template to my host I got this error:
There's no tutorial on how to use this template,... -
How to detect when a GPU becomes unresponsive?
My company has a server with 8 Nvidia RTX 8000 GPUs in it (for AI training). One of the GPUs has been being problematic and will randomly crash and not show up anymore. I want to create a trigger in Zabbix for when that happens so I can get alerted. Right now we are using the Nvidia GPU template that... -
Zabbix-GPU data Aggregation
Heyo,
I'm using this zabbix template (https://share.zabbix.com/cat-server-...-multiple-gpus) to monitor GPU Servers.
Some servers have 1 GPU and some have 2 GPUs.
I'm looking for a way to aggregate data (GPU Usage) from all GPUs into... -
Monitoring NVIDIA device
I have windows servers that uses NVIDIA GPU for various tasks. I would like to monitor its usage using zabbix server. How can it be possible to show the NVIDIA usage of different server computers using Zabbix graph.
I tried using SNMP technique but I cannot get the usage value of NVIDIA devi...