Hi, we've got a very crytical system that i am monitoring with zabbix.
Lately i have a problem that zabbix_server fails.
However i did a simple crontab script to try start server every 10 minutes.
In case it runs it's ok.
In case not, then i have a problem (also with agents)
there is a pid file that is never deleted.and i need delete it manually.
However i agree that i may write a script to check server/agent and if not running then try delete pid file and start it again.
But may be there is another solution.
Also we agents i noticed that it usually takes time to start it.
i always delete pid, start and check process. and it's not running.
again deleting pid and again the same loop until i notice it running.
The thing makes me crazy.
Is there any solution for it?
does someone experienced similar problem and have custom script?
thanks
Lately i have a problem that zabbix_server fails.
However i did a simple crontab script to try start server every 10 minutes.
In case it runs it's ok.
In case not, then i have a problem (also with agents)
there is a pid file that is never deleted.and i need delete it manually.
However i agree that i may write a script to check server/agent and if not running then try delete pid file and start it again.
But may be there is another solution.
Also we agents i noticed that it usually takes time to start it.
i always delete pid, start and check process. and it's not running.
again deleting pid and again the same loop until i notice it running.
The thing makes me crazy.
Is there any solution for it?
does someone experienced similar problem and have custom script?
thanks
Comment