Hi Everyone,
This is my first post; hence apologies in advance if i have missed some of the forum rules.
Brief of the infrastructure
We have s couple of 10's of servers which are used for Compute jobs for customers.
Our customers can access our portal (web application), submit the compute files and jobs ad fire the computational job.
there is a backend automated system which does a power on (via ipmi) to the baremetal servers. These servers boot to a proxmox platform and in turn powers on the required VMs. The VM complete the job and systems are powered down in (first VM and then baremetals). All this is managed by our in house application team.
I am trying to get zabbix to monitor these servers and give us a periodic report showing which nodes have been powered on for how much duration in a given time window (day / week / month etc)
For example node 5
day 1 power on time; power off time; power on hours
Day1 power on time; power off time; power on hours (might be a second power on during the same day)
Day2 power on time; power off time; power on hours
and so on for all the nodes
i would prefer to get this via ipmi power status (+ ping preferably)
I have tried to search on google and forums but unable to find any thing, hence the topic here.
I can be considered pretty average user on linux, networking, snmp. however pretty new to zabbix.
Any advice / suggestions will really help.
If i have missed any information above, will be happy to address the same.
Many thanks in advance
This is my first post; hence apologies in advance if i have missed some of the forum rules.
Brief of the infrastructure
We have s couple of 10's of servers which are used for Compute jobs for customers.
Our customers can access our portal (web application), submit the compute files and jobs ad fire the computational job.
there is a backend automated system which does a power on (via ipmi) to the baremetal servers. These servers boot to a proxmox platform and in turn powers on the required VMs. The VM complete the job and systems are powered down in (first VM and then baremetals). All this is managed by our in house application team.
I am trying to get zabbix to monitor these servers and give us a periodic report showing which nodes have been powered on for how much duration in a given time window (day / week / month etc)
For example node 5
day 1 power on time; power off time; power on hours
Day1 power on time; power off time; power on hours (might be a second power on during the same day)
Day2 power on time; power off time; power on hours
and so on for all the nodes
i would prefer to get this via ipmi power status (+ ping preferably)
I have tried to search on google and forums but unable to find any thing, hence the topic here.
I can be considered pretty average user on linux, networking, snmp. however pretty new to zabbix.
Any advice / suggestions will really help.
If i have missed any information above, will be happy to address the same.
Many thanks in advance
Comment