I have been using Zabbix successfully for a couple of years to monitor devices on remote networks. It is a simple installation that is currently monitoring 123 devices distributed over 18 different remote networks. The server connects to each of these networks via IPSEC links. The network where the server is located uses a slow 1mbs dedicated link for IPSEC traffic. The only traffic that passes over this link to each of the 18 networks is either Zabbix traffic or administrator remote access maintenance traffic. (i.e. on demand ssh traffic or rdp traffic for troubleshooting etc.) Each device is being monitored by the unmodified OS template (i.e. Linux/Windows/BSD) for the device.
Recently, I have been experiencing significant delays when working across this connection performing maintenance tasks. I have determined that the data between the Zabbix server and the devices is consuming all of the bandwidth for the connection. With all other traffic removed from the connections, I am seeing that the Zabbix server is sending and receiving over 2.5 gigabytes of data in each direction daily.
I have searched through the forums, but I can't seem to find anything that:
1 - identifies bandwidth requirements between server and client. (i.e. how much data is transferred between the server and each client when performing standard checks? Is my reported bandwidth to be expected for the number of devices etc.?)
2 - I can increase or decrease the number of pollers, trappers and timers but I can't find any parameter in the zabbix_server.conf file to adjust the timing of the pollers. In my usage case I am really just checking that each device is alive and well. While real-time notifications are desirable, I'd be happy to trade delayed checks with a reduction in bandwidth usage.
Any suggestions? (I realize that the easy answer is just add more bandwidth).
Recently, I have been experiencing significant delays when working across this connection performing maintenance tasks. I have determined that the data between the Zabbix server and the devices is consuming all of the bandwidth for the connection. With all other traffic removed from the connections, I am seeing that the Zabbix server is sending and receiving over 2.5 gigabytes of data in each direction daily.
I have searched through the forums, but I can't seem to find anything that:
1 - identifies bandwidth requirements between server and client. (i.e. how much data is transferred between the server and each client when performing standard checks? Is my reported bandwidth to be expected for the number of devices etc.?)
2 - I can increase or decrease the number of pollers, trappers and timers but I can't find any parameter in the zabbix_server.conf file to adjust the timing of the pollers. In my usage case I am really just checking that each device is alive and well. While real-time notifications are desirable, I'd be happy to trade delayed checks with a reduction in bandwidth usage.
Any suggestions? (I realize that the easy answer is just add more bandwidth).
Comment