Ad Widget

**Pada** · 21-09-2013, 22:45

We typically don't use ICMP monitoring inside our network, because we're monitoring the errors/loss reported by the routers.

We do however use ICMP to monitor hosts outside our control to check our Internet connectivity, in which case its typically 2-3 hosts on the Internet per data center.

Our typical setting for monitoring latency is every 5 seconds, with the key: "icmppingsec[,3,100,,1500]"
I did increase our Zabbix server's StartPingers to 5, because as soon as hosts go offline, it cannot accurately keep track of the online hosts. We have about 60 hosts that we monitor in total, across 3 data centers and the icmp pinger buzy value sits at 35%.

My friends who are running a Wireless ISP is using Smokeping to graph/monitor their network connectivity, which seems to work quite well.

If you don't want to go that route and want to reduce the amount of idle DB connections, you can always create extra Zabbix proxies with local DB's, because they'll then do the ICMP stuff for you.

**ILIV** · 21-09-2013, 22:48

Thanks for the input Pada. It was quite interesting to read about your setup. I encourage more people to share this kind of information that we all can see what kind of load to expect in different types of configurations.

**timbo** · 23-09-2013, 03:35

Hi ILIV,

Firstly I'd like to state that I am entirely unqualified to answer this, but I do have a couple of queries.

Would it possibly be better to monitor the host NIC throughput via (net.if.in, net.if.out & net.if.total / perf_counter in Windows)? Thus avoiding flooding the network with (potentially) unnecessary ICMP traffic.

That is of course if you're looking to monitor. Though it sounds as if you're interested in load testing, of which Ping may not be the best tool. Obviously ping is designed to test connectivity and round trip time, but don't forget that hosts/routers will drop ICMP packets if they're experiencing a high load, and the hosts are required to send back exactly the same data that was sent to them.

Perhaps this is your problem - you have one server punching out 100 x icmppingloss, 100 x icmppingsec to 31 hosts (potentially 6200 packets x 1024bytes per cycle), then the server needs to receive (and process) all these request back from the hosts (potentially DDOSing itself). As they all come back in the server needs to read 6200 packets x 1024 byte to ensure that they have arrived unaltered/uncorrupted. I have no idea if the load created from this would significant or not, it may not be, just a thought.

I can't suggest the best (or even a better) tool/method to load test your hosts, but from experience I'd normally gather a baseline of acceptable (or current) throughput (using a monitoring system such as Zabbix). Then perform a one off (weekly/monthly/yearly) load test (using something appropriate to your situation/application). Using the results of the load test, you can then identify the loads/levels at which you would like to be notified, and then perhaps you can create a trigger/action in Zabbix accordingly.

Sorry if I missed the point entirely and ICMP is required in this situation, but personally I limit its use to testing for connectivity, packet loss and latency, not testing network load.

-Timbo

**ILIV** · 23-09-2013, 09:14

Simply put, I've seen time and again that a ping with default settings (1 packet of 56 bytes per 1 second) may fail to reliably demonstrate issues on a congested link or an unreliable connection.

Running ping with a larger packet size is going to give one a better idea of how data intensive applications are going to do when their doing their typical business. In most cases, we're all interested in how the actual applications, increasingly data intensive ones, are going to perform, not just the networking stack.

In my experience, 100 packets sent out rapidly at speed of anywhere from 10-100ms in adaptive mode with preferably larger enough packet size gives me at least approximately good idea about the real state of a link. Anything less may result in an incomplete picture.

I've just learned from experience that this set of options for ping can make all the difference. Depending on what medium is used: ethernet, ffx, dsl, wireless etc. -- especially wireless -- it can be very important. And thus I use this appoach intentionally to test link stability on the LAN.

It is not about emulating an application workload.

Ad Widget

Reducing icmp pinger processes load

Reducing icmp pinger processes load

Comment

Comment

Comment

Comment