Hi all,
I have a Zabbix server which contains some hosts (8 Switch devices and 40 Linux servers). Monitoring process is going to be good except in some cases I get the wrong values in Latest data.
I have created a script to get RAID status of my Linux servers and put it in zabbix_agentd.conf with the proper syntax. Until now everything is OK and when I check the status of a hard disk in RAID array with the use of zabbix_get command I can see the value "Online". The problem is that when I check the status of that Hard disk in the latest data, I see "Failed" value for it !!
The problem is that it is fluctuating, I mean when a check is occurred in Zabbix (1 per 5 minutes) it shows the value "Online" and after 2 or 3 next checks, it gets "Failed" and after some checks it shows "Online" again!!
I am using LLD for discovering Raid type and hard disks inside the arrays. This problem does not happen for all hosts only a few hosts are facing this issue.
Any help is appreciated.
I have a Zabbix server which contains some hosts (8 Switch devices and 40 Linux servers). Monitoring process is going to be good except in some cases I get the wrong values in Latest data.
I have created a script to get RAID status of my Linux servers and put it in zabbix_agentd.conf with the proper syntax. Until now everything is OK and when I check the status of a hard disk in RAID array with the use of zabbix_get command I can see the value "Online". The problem is that when I check the status of that Hard disk in the latest data, I see "Failed" value for it !!
The problem is that it is fluctuating, I mean when a check is occurred in Zabbix (1 per 5 minutes) it shows the value "Online" and after 2 or 3 next checks, it gets "Failed" and after some checks it shows "Online" again!!
I am using LLD for discovering Raid type and hard disks inside the arrays. This problem does not happen for all hosts only a few hosts are facing this issue.
Any help is appreciated.
Comment