Ad Widget

**Linwood** · 30-06-2016, 17:00

Just one general comment. If you have a high-cost (in terms of run time or resources) check you are doing, try to make it gather a lot of information and upload via zabbix_sender.

So if you have to ssh into a device to get items, and there are 10 things you want to monitor, do one external check routine for one item, and inside of that check have it return the other 9 items via zabbix_sender, so you only need to log in once.

One issue you might hit is that there's a hard coded timeout limit of 30 seconds for checks, so you need to be able to get in and out in less than that (there is a configuration file setting for timeout, it just can't be over 30 without patching the system).

I am guessing that you cannot install software on the EMS stations, but if you could you might put a zabbix proxy there, so it could do the polling locally and consolidate and feed back to a central zabbix server to provide some efficiency.

I guess for people to be much more help, it might be worthwhile to indicate where you expect the issue -- is it bandwidth from a central site to the EMS locations, to the actual devices, speed of response from the devices (independent of bandwidth), load on the zabbix server, interference with the existing NMS, or with the EMS stations, or...

**iyossi** · 03-07-2016, 07:35

Originally posted by Linwood

Just one general comment. If you have a high-cost (in terms of run time or resources) check you are doing, try to make it gather a lot of information and upload via zabbix_sender.

So if you have to ssh into a device to get items, and there are 10 things you want to monitor, do one external check routine for one item, and inside of that check have it return the other 9 items via zabbix_sender, so you only need to log in once.

This sounds like a good idea, I'll try it

One issue you might hit is that there's a hard coded timeout limit of 30 seconds for checks, so you need to be able to get in and out in less than that (there is a configuration file setting for timeout, it just can't be over 30 without patching the system).

I have no problem patching it, but I guess there is a good reason for such a timeout.
But, if I continue with your first idea, can't I fork from the first/main external check item, exit the check in few seconds, and send the many results in the next few minutes ?

Regards,
Yossi

**Linwood** · 03-07-2016, 17:21

Originally posted by iyossi

I have no problem patching it, but I guess there is a good reason for such a timeout.

My GUESS is that the limited number of polling processes do not get filled with ones in a wait state, but I really am not sure. I felt the same way and didn't, though I still might, I find some powershell scripts tend to just barely compelte in time, and sometimes not quite in time.

Originally posted by iyossi

But, if I continue with your first idea, can't I fork from the first/main external check item, exit the check in few seconds, and send the many results in the next few minutes ?

Sure. I had one like that which I forked (you need to fork twice of course to keep the parent's demise from taking out the child), and then had it loop to send values without polling. I ended up not using it as I found a better way; mine was a pain as it was going to run for hours (looping, not each test), and in particular while debugging I hated having to track down all those processes and kill them. But there's no reason in principle you can't let them run as long as you want and send with zabbix_sender. Of course, by being disconnected you can't control them from zabbix (enable/disable/restarts/config changes, etc.).

Ad Widget

An advanced configuration question

An advanced configuration question

Comment

Comment

Comment