Ad Widget

Collapse

IPMI Template For IBM Integrated Management Module

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • Andre Torres
    Junior Member
    • Jun 2012
    • 2

    #16
    Where did you find them?

    Sire,

    First of all, I'd like to thank you for the templates. I'm testing them just now.
    I was wondering where you got those item keys and sensors. Is there any documentation for this?

    I'd like to also monitor disk's health, so guess I need to know their keys, but I don't know how.

    Thank you in advance.

    Andre Torres

    Comment

    • sire
      Senior Member
      • Jul 2010
      • 210

      #17
      Hi Andre,

      Originally posted by Andre Torres
      I was wondering where you got those item keys and sensors. Is there any documentation for this?
      The template was generated with the help of the scripts published here.

      Originally posted by Andre Torres
      I'd like to also monitor disk's health, so guess I need to know their keys, but I don't know how.
      IBM IMM IPMI board doesn't know anything about disk health, as far as I know. You need to install Zabbix agent inside the host OS and use smartctl or something else to get these data. See Zabbix docs on UserParameter.
      Regards,
      Sergey Syreskin

      Monitored hosts: 2646 / Active items: 23604 / Server performance: 765.74

      Temporary out of Zabbix business

      Comment

      • Andre Torres
        Junior Member
        • Jun 2012
        • 2

        #18
        Originally posted by sire
        Hi Andre,


        The template was generated with the help of the scripts published here.


        IBM IMM IPMI board doesn't know anything about disk health, as far as I know. You need to install Zabbix agent inside the host OS and use smartctl or something else to get these data. See Zabbix docs on UserParameter.

        Hello again Sergey.
        I'm running VMware ESXi 5.0 on my server, that's why I can't use the agent to get info about disk health.

        I runned this command on my Zabbix Server:

        Code:
        ipmitool -H SERVER_IMM_IP -U USER -P PASSWORD sdr
        If I understood it right, this command gives me the sensors I need to monitor. Here is the result:

        Code:
        Planar 3.3V      | 3.32 Volts        | ok
        Planar 5V        | 5.04 Volts        | ok
        Planar 12V       | 11.99 Volts       | ok
        Planar 5V SB     | 5.02 Volts        | ok
        CPU 1 VCore      | 1.07 Volts        | ok
        CPU 2 VCore      | disabled          | ns
        Planar VBAT      | 3.09 Volts        | ok
        CPU 1 VDIMM      | 1.50 Volts        | ok
        CPU 2 VDIMM      | disabled          | ns
        IOH 1.1V         | 1.10 Volts        | ok
        Avg Power        | 200 Watts         | ok
        Ambient Temp     | 24 degrees C      | ok
        IMM Temp         | 51.20 degrees C   | ok
        Fan 1 Tach       | 1750 RPM          | ok
        Fan 2 Tach       | 1750 RPM          | ok
        Fan 3 Tach       | 1750 RPM          | ok
        Host Power       | 0x00              | ok
        IPMI Watchdog    | 0x00              | ok
        Power 12V Fault  | 0x01              | ok
        VRD              | 0x00              | ok
        VRM              | Not Readable      | ns
        Pwr Rail A Fault | 0x01              | ok
        Pwr Rail B Fault | 0x01              | ok
        Pwr Rail C Fault | 0x01              | ok
        Pwr Rail D Fault | 0x01              | ok
        Pwr Rail E Fault | 0x01              | ok
        Pwr Rail F Fault | 0x01              | ok
        IOH Temp Status  | 0x01              | ok
        All DIMMS        | 0x00              | ok
        One of the DIMMs | 0x00              | ok
        Backup Memory    | 0x00              | ok
        OS RealTime Mod  | 0x00              | ok
        Fan 1            | 0x01              | ok
        Fan 2            | 0x01              | ok
        Fan 3            | 0x01              | ok
        Cooling Zone 1   | 0x01              | ok
        Cooling Zone 2   | 0x01              | ok
        Cooling Zone 3   | 0x01              | ok
        Drive 0          | 0x01              | ok
        Drive 1          | 0x01              | ok
        Drive 2          | 0x01              | ok
        Drive 3          | 0x01              | ok
        Drive 4          | 0x01              | ok
        Drive 5          | 0x01              | ok
        Drive 6          | 0x00              | ok
        Drive 7          | 0x00              | ok
        Drive 8          | 0x00              | ok
        Drive 9          | 0x00              | ok
        Drive 10         | 0x00              | ok
        Drive 11         | 0x00              | ok
        Drive 12         | 0x00              | ok
        Drive 13         | 0x00              | ok
        Drive 14         | 0x00              | ok
        Drive 15         | 0x00              | ok
        Power Supply 1   | 0x01              | ok
        Power Supply 2   | 0x01              | ok
        Power Backplane  | 0x01              | ok
        PS 1 Fan Fault   | 0x01              | ok
        PS 2 Fan Fault   | 0x01              | ok
        PS 1 Therm Fault | 0x01              | ok
        PS 2 Therm Fault | 0x01              | ok
        PS1 12V OV Fault | 0x01              | ok
        PS2 12V OV Fault | 0x01              | ok
        PS1 12V UV Fault | 0x01              | ok
        PS2 12V UV Fault | 0x01              | ok
        PS1 12V OC Fault | 0x01              | ok
        PS2 12V OC Fault | 0x01              | ok
        PS 1 VCO Fault   | 0x01              | ok
        PS 2 VCO Fault   | 0x01              | ok
        Power Unit       | 0x01              | ok
        NMI State        | 0x00              | ok
        RAID Error       | 0x00              | ok
        PCI Riser 1      | 0x02              | ok
        CPU 1            | 0x80              | ok
        CPU 2            | 0x00              | ok
        ABR Status       | 0x00              | ok
        All CPUs         | 0x00              | ok
        One of The CPUs  | 0x00              | ok
        DASD Backplane 1 | 0x01              | ok
        DASD Backplane 2 | 0x02              | ok
        DASD Backplane 3 | 0x02              | ok
        DASD Backplane 4 | 0x02              | ok
        PCIs             | 0x00              | ok
        CPUs             | 0x00              | ok
        DIMMs            | 0x00              | ok
        Sys Board Fault  | 0x00              | ok
        Firmware Error   | 0x00              | ok
        Progress         | 0x00              | ok
        SEL Fullness     | 0x00              | ok
        CPU 1 OverTemp   | 0x01              | ok
        CPU 2 OverTemp   | 0x01              | ok
        All PCI Error    | 0x00              | ok
        PCI 1            | 0x00              | ok
        PCI 2            | 0x00              | ok
        PCI 3            | 0x00              | ok
        PCI 4            | 0x00              | ok
        PCI 5            | 0x00              | ok
        PCI 6            | 0x00              | ok
        PCI 7            | 0x00              | ok
        PCI 8            | 0x00              | ok
        CPU Fault Reboot | 0x00              | ok
        Aux Log          | 0x00              | ok
        One of PCI Error | 0x00              | ok
        DIMM 1           | 0x00              | ok
        DIMM 2           | 0x40              | ok
        DIMM 3           | 0x40              | ok
        DIMM 4           | 0x00              | ok
        DIMM 5           | 0x00              | ok
        DIMM 6           | 0x40              | ok
        DIMM 7           | 0x00              | ok
        DIMM 8           | 0x40              | ok
        DIMM 9           | 0x00              | ok
        DIMM 10          | 0x00              | ok
        DIMM 11          | 0x00              | ok
        DIMM 12          | 0x00              | ok
        DIMM 13          | 0x00              | ok
        DIMM 14          | 0x00              | ok
        DIMM 15          | 0x00              | ok
        DIMM 16          | 0x00              | ok
        DIMM 1 Temp      | 0x00              | ok
        DIMM 2 Temp      | 0x01              | ok
        DIMM 3 Temp      | 0x01              | ok
        DIMM 4 Temp      | 0x00              | ok
        DIMM 5 Temp      | 0x00              | ok
        DIMM 6 Temp      | 0x01              | ok
        DIMM 7 Temp      | 0x00              | ok
        DIMM 8 Temp      | 0x01              | ok
        DIMM 9 Temp      | 0x00              | ok
        DIMM 10 Temp     | 0x00              | ok
        DIMM 11 Temp     | 0x00              | ok
        DIMM 12 Temp     | 0x00              | ok
        DIMM 13 Temp     | 0x00              | ok
        DIMM 14 Temp     | 0x00              | ok
        DIMM 15 Temp     | 0x00              | ok
        DIMM 16 Temp     | 0x00              | ok
        Now I can monitor, for example, the "Fan 1 Tach" information (I get 1750 RPM as the result). However, when I try to monitor "Drive 0" (the result should be 0x01), I got a not supported error in Zabbix. I mean, I can only monitor things that result in Text or Number.

        Is it the "Type of information" that I'm missing here? I tried setting it to Numeric (unsigned) + Hexadecimal but it didn't work.

        Thanks a lot!

        Comment

        • sire
          Senior Member
          • Jul 2010
          • 210

          #19
          Hello, Andre!

          Register and vote for this feature here.

          Originally posted by Andre Torres
          Hello again Sergey.
          I'm running VMware ESXi 5.0 on my server, that's why I can't use the agent to get info about disk health.

          I runned this command on my Zabbix Server:

          Code:
          ipmitool -H SERVER_IMM_IP -U USER -P PASSWORD sdr
          If I understood it right, this command gives me the sensors I need to monitor. Here is the result:

          Code:
          Planar 3.3V      | 3.32 Volts        | ok
          Planar 5V        | 5.04 Volts        | ok
          Planar 12V       | 11.99 Volts       | ok
          Planar 5V SB     | 5.02 Volts        | ok
          CPU 1 VCore      | 1.07 Volts        | ok
          CPU 2 VCore      | disabled          | ns
          Planar VBAT      | 3.09 Volts        | ok
          CPU 1 VDIMM      | 1.50 Volts        | ok
          CPU 2 VDIMM      | disabled          | ns
          IOH 1.1V         | 1.10 Volts        | ok
          Avg Power        | 200 Watts         | ok
          Ambient Temp     | 24 degrees C      | ok
          IMM Temp         | 51.20 degrees C   | ok
          Fan 1 Tach       | 1750 RPM          | ok
          Fan 2 Tach       | 1750 RPM          | ok
          Fan 3 Tach       | 1750 RPM          | ok
          Host Power       | 0x00              | ok
          IPMI Watchdog    | 0x00              | ok
          Power 12V Fault  | 0x01              | ok
          VRD              | 0x00              | ok
          VRM              | Not Readable      | ns
          Pwr Rail A Fault | 0x01              | ok
          Pwr Rail B Fault | 0x01              | ok
          Pwr Rail C Fault | 0x01              | ok
          Pwr Rail D Fault | 0x01              | ok
          Pwr Rail E Fault | 0x01              | ok
          Pwr Rail F Fault | 0x01              | ok
          IOH Temp Status  | 0x01              | ok
          All DIMMS        | 0x00              | ok
          One of the DIMMs | 0x00              | ok
          Backup Memory    | 0x00              | ok
          OS RealTime Mod  | 0x00              | ok
          Fan 1            | 0x01              | ok
          Fan 2            | 0x01              | ok
          Fan 3            | 0x01              | ok
          Cooling Zone 1   | 0x01              | ok
          Cooling Zone 2   | 0x01              | ok
          Cooling Zone 3   | 0x01              | ok
          Drive 0          | 0x01              | ok
          Drive 1          | 0x01              | ok
          Drive 2          | 0x01              | ok
          Drive 3          | 0x01              | ok
          Drive 4          | 0x01              | ok
          Drive 5          | 0x01              | ok
          Drive 6          | 0x00              | ok
          Drive 7          | 0x00              | ok
          Drive 8          | 0x00              | ok
          Drive 9          | 0x00              | ok
          Drive 10         | 0x00              | ok
          Drive 11         | 0x00              | ok
          Drive 12         | 0x00              | ok
          Drive 13         | 0x00              | ok
          Drive 14         | 0x00              | ok
          Drive 15         | 0x00              | ok
          Power Supply 1   | 0x01              | ok
          Power Supply 2   | 0x01              | ok
          Power Backplane  | 0x01              | ok
          PS 1 Fan Fault   | 0x01              | ok
          PS 2 Fan Fault   | 0x01              | ok
          PS 1 Therm Fault | 0x01              | ok
          PS 2 Therm Fault | 0x01              | ok
          PS1 12V OV Fault | 0x01              | ok
          PS2 12V OV Fault | 0x01              | ok
          PS1 12V UV Fault | 0x01              | ok
          PS2 12V UV Fault | 0x01              | ok
          PS1 12V OC Fault | 0x01              | ok
          PS2 12V OC Fault | 0x01              | ok
          PS 1 VCO Fault   | 0x01              | ok
          PS 2 VCO Fault   | 0x01              | ok
          Power Unit       | 0x01              | ok
          NMI State        | 0x00              | ok
          RAID Error       | 0x00              | ok
          PCI Riser 1      | 0x02              | ok
          CPU 1            | 0x80              | ok
          CPU 2            | 0x00              | ok
          ABR Status       | 0x00              | ok
          All CPUs         | 0x00              | ok
          One of The CPUs  | 0x00              | ok
          DASD Backplane 1 | 0x01              | ok
          DASD Backplane 2 | 0x02              | ok
          DASD Backplane 3 | 0x02              | ok
          DASD Backplane 4 | 0x02              | ok
          PCIs             | 0x00              | ok
          CPUs             | 0x00              | ok
          DIMMs            | 0x00              | ok
          Sys Board Fault  | 0x00              | ok
          Firmware Error   | 0x00              | ok
          Progress         | 0x00              | ok
          SEL Fullness     | 0x00              | ok
          CPU 1 OverTemp   | 0x01              | ok
          CPU 2 OverTemp   | 0x01              | ok
          All PCI Error    | 0x00              | ok
          PCI 1            | 0x00              | ok
          PCI 2            | 0x00              | ok
          PCI 3            | 0x00              | ok
          PCI 4            | 0x00              | ok
          PCI 5            | 0x00              | ok
          PCI 6            | 0x00              | ok
          PCI 7            | 0x00              | ok
          PCI 8            | 0x00              | ok
          CPU Fault Reboot | 0x00              | ok
          Aux Log          | 0x00              | ok
          One of PCI Error | 0x00              | ok
          DIMM 1           | 0x00              | ok
          DIMM 2           | 0x40              | ok
          DIMM 3           | 0x40              | ok
          DIMM 4           | 0x00              | ok
          DIMM 5           | 0x00              | ok
          DIMM 6           | 0x40              | ok
          DIMM 7           | 0x00              | ok
          DIMM 8           | 0x40              | ok
          DIMM 9           | 0x00              | ok
          DIMM 10          | 0x00              | ok
          DIMM 11          | 0x00              | ok
          DIMM 12          | 0x00              | ok
          DIMM 13          | 0x00              | ok
          DIMM 14          | 0x00              | ok
          DIMM 15          | 0x00              | ok
          DIMM 16          | 0x00              | ok
          DIMM 1 Temp      | 0x00              | ok
          DIMM 2 Temp      | 0x01              | ok
          DIMM 3 Temp      | 0x01              | ok
          DIMM 4 Temp      | 0x00              | ok
          DIMM 5 Temp      | 0x00              | ok
          DIMM 6 Temp      | 0x01              | ok
          DIMM 7 Temp      | 0x00              | ok
          DIMM 8 Temp      | 0x01              | ok
          DIMM 9 Temp      | 0x00              | ok
          DIMM 10 Temp     | 0x00              | ok
          DIMM 11 Temp     | 0x00              | ok
          DIMM 12 Temp     | 0x00              | ok
          DIMM 13 Temp     | 0x00              | ok
          DIMM 14 Temp     | 0x00              | ok
          DIMM 15 Temp     | 0x00              | ok
          DIMM 16 Temp     | 0x00              | ok
          Now I can monitor, for example, the "Fan 1 Tach" information (I get 1750 RPM as the result). However, when I try to monitor "Drive 0" (the result should be 0x01), I got a not supported error in Zabbix. I mean, I can only monitor things that result in Text or Number.

          Is it the "Type of information" that I'm missing here? I tried setting it to Numeric (unsigned) + Hexadecimal but it didn't work.

          Thanks a lot!
          Regards,
          Sergey Syreskin

          Monitored hosts: 2646 / Active items: 23604 / Server performance: 765.74

          Temporary out of Zabbix business

          Comment

          • ZabbixSon
            Junior Member
            • Nov 2014
            • 17

            #20
            What should i put?

            Hello Sire,

            i have some question about the template, what should i put here?

            <useip>*****</useip>
            <dns>*****</dns>
            <ip>*****</ip>
            <port>*****</port>
            <status>*****</status>
            <useipmi>*****</useipmi>
            <ipmi_ip>*****</ipmi_ip>
            <ipmi_port>*****</ipmi_port>
            <ipmi_authtype>*****</ipmi_authtype>
            <ipmi_privilege>*****</ipmi_privilege>
            <ipmi_username>*****</ipmi_username>
            <ipmi_password>*****</ipmi_password>

            Regards,

            ZabbixSon

            Comment

            • sire
              Senior Member
              • Jul 2010
              • 210

              #21
              Hi ZabbixSon,

              These vallues are set in GUI, not in the template file. Please see Zabbix manual on IPMI.
              Regards,
              Sergey Syreskin

              Monitored hosts: 2646 / Active items: 23604 / Server performance: 765.74

              Temporary out of Zabbix business

              Comment

              Working...