I have been setting up a monitoring, where an agent fpings different host, filters the reply time and presenting in a graph like this:
The agent is also pinging itself as a control mechanism
This is the config:
UserParameter=net.ping_cloudflare,c:\drift\fping 1.1.1.1 -n 1 -i -l | "C:\Program Files (x86)\GnuWin32\bin\gawk.exe" "{print substr($5,6); }"
UserParameter=net.ping_localhost,c:\drift\fping 127.0.0.1 -n 1 -i -l | "C:\Program Files (x86)\GnuWin32\bin\gawk.exe" "{print substr($5,6); }"
So in this case, it seems that cloudflare times out around 08:58:43
But I do not that cloudflare is down around that time... So to prove this, I have created an extra control at the localhost:
c:\drift\fping 1.1.1.1 -i -l -D -T -c >> c:\drift\cloudflare.txt
and the cloudflare.txt shows this the the time period:
2023/01/05 08:58:37.110 : Reply[74096] from 1.1.1.1: bytes=32 time=1.4 ms TTL=56
2023/01/05 08:58:38.114 : Reply[74097] from 1.1.1.1: bytes=32 time=2.5 ms TTL=56
2023/01/05 08:58:39.115 : Reply[74098] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:40.117 : Reply[74099] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:41.119 : Reply[74100] from 1.1.1.1: bytes=32 time=2.1 ms TTL=56
2023/01/05 08:58:42.121 : Reply[74101] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:43.123 : Reply[74102] from 1.1.1.1: bytes=32 time=1.7 ms TTL=56
2023/01/05 08:58:44.124 : Reply[74103] from 1.1.1.1: bytes=32 time=1.4 ms TTL=56
2023/01/05 08:58:45.127 : Reply[74104] from 1.1.1.1: bytes=32 time=1.5 ms TTL=56
2023/01/05 08:58:46.128 : Reply[74105] from 1.1.1.1: bytes=32 time=1.2 ms TTL=56
2023/01/05 08:58:47.130 : Reply[74106] from 1.1.1.1: bytes=32 time=1.5 ms TTL=56
2023/01/05 08:58:48.133 : Reply[74107] from 1.1.1.1: bytes=32 time=2.0 ms TTL=56
2023/01/05 08:58:49.141 : Reply[74108] from 1.1.1.1: bytes=32 time=8.0 ms TTL=56
2023/01/05 08:58:50.143 : Reply[74109] from 1.1.1.1: bytes=32 time=1.4 ms TTL=56
2023/01/05 08:58:51.144 : Reply[74110] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:52.146 : Reply[74111] from 1.1.1.1: bytes=32 time=1.2 ms TTL=56
2023/01/05 08:58:53.149 : Reply[74112] from 1.1.1.1: bytes=32 time=2.1 ms TTL=56
indicating no problems at all.
I would like to trust the zabbix monitoring - but I cannot figure out, where the issue is. I was thinking that it might be because the agent is not able to deliver its collection in certain time periods, but I have change the buffersend and buffersize as in my prior post. Any other clues?
Regards, Lars.
The agent is also pinging itself as a control mechanism
This is the config:
UserParameter=net.ping_cloudflare,c:\drift\fping 1.1.1.1 -n 1 -i -l | "C:\Program Files (x86)\GnuWin32\bin\gawk.exe" "{print substr($5,6); }"
UserParameter=net.ping_localhost,c:\drift\fping 127.0.0.1 -n 1 -i -l | "C:\Program Files (x86)\GnuWin32\bin\gawk.exe" "{print substr($5,6); }"
So in this case, it seems that cloudflare times out around 08:58:43
But I do not that cloudflare is down around that time... So to prove this, I have created an extra control at the localhost:
c:\drift\fping 1.1.1.1 -i -l -D -T -c >> c:\drift\cloudflare.txt
and the cloudflare.txt shows this the the time period:
2023/01/05 08:58:37.110 : Reply[74096] from 1.1.1.1: bytes=32 time=1.4 ms TTL=56
2023/01/05 08:58:38.114 : Reply[74097] from 1.1.1.1: bytes=32 time=2.5 ms TTL=56
2023/01/05 08:58:39.115 : Reply[74098] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:40.117 : Reply[74099] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:41.119 : Reply[74100] from 1.1.1.1: bytes=32 time=2.1 ms TTL=56
2023/01/05 08:58:42.121 : Reply[74101] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:43.123 : Reply[74102] from 1.1.1.1: bytes=32 time=1.7 ms TTL=56
2023/01/05 08:58:44.124 : Reply[74103] from 1.1.1.1: bytes=32 time=1.4 ms TTL=56
2023/01/05 08:58:45.127 : Reply[74104] from 1.1.1.1: bytes=32 time=1.5 ms TTL=56
2023/01/05 08:58:46.128 : Reply[74105] from 1.1.1.1: bytes=32 time=1.2 ms TTL=56
2023/01/05 08:58:47.130 : Reply[74106] from 1.1.1.1: bytes=32 time=1.5 ms TTL=56
2023/01/05 08:58:48.133 : Reply[74107] from 1.1.1.1: bytes=32 time=2.0 ms TTL=56
2023/01/05 08:58:49.141 : Reply[74108] from 1.1.1.1: bytes=32 time=8.0 ms TTL=56
2023/01/05 08:58:50.143 : Reply[74109] from 1.1.1.1: bytes=32 time=1.4 ms TTL=56
2023/01/05 08:58:51.144 : Reply[74110] from 1.1.1.1: bytes=32 time=1.3 ms TTL=56
2023/01/05 08:58:52.146 : Reply[74111] from 1.1.1.1: bytes=32 time=1.2 ms TTL=56
2023/01/05 08:58:53.149 : Reply[74112] from 1.1.1.1: bytes=32 time=2.1 ms TTL=56
indicating no problems at all.
I would like to trust the zabbix monitoring - but I cannot figure out, where the issue is. I was thinking that it might be because the agent is not able to deliver its collection in certain time periods, but I have change the buffersend and buffersize as in my prior post. Any other clues?
Regards, Lars.
ort (or hostname
Comment