Hello,
I am having tough time in getting this icmppingsec/icmpping work for the added hosts. There's this weird issue where for hosts zabbix reports its not reachable - via alert - where, when i right click and select ping from the dashboard host seems to be reachable.
Problem statement : Checking device reachability on zabbix via icmppingsec or icmpping. The reason for icmppingsec is also return response time in seconds.
utility installed : fping - all permissions proper.
Item:
This item config seems to be working where , i am able to get the ping response time in case of icmppingsec and value 1/0 in case of icmpping.
Trigger:
Problem is : For hosts, all of sudden, zabbix says hosts are down - i.e either icmppingsec or icmpping value is zero. But on next iteration of icmpingsec, its proper i.e not zero and trigger becomes OK from PROBLEM.
During the alert, if i check the reachablity, device seems reachable , and on the device, BGP seems fine so no flaps.
How do i debug these kind of issues? All in zabbix logs i see this,
How do i check these kind of issue? From the zabbix perspective, zabbix could not reach host as per logs but from WAN reachablity perspective, device wasn't gone down (no bgp flaps) or no problem in reaching via ping 
PS:
1).Would increasing packet count from 5 to 10 help?
2).Would decreaing interval from 3mintue to 1 minute help? - i know this would choke network and need tuning on zabbix end. I have close to 3.5K devices to poll and about 2 proxies plus one server doing that.
Server Conf:
Proxy1
Proxy2
Any pointers on how do i bring down these kind of alerts are greatly helpful.
Thanks
I am having tough time in getting this icmppingsec/icmpping work for the added hosts. There's this weird issue where for hosts zabbix reports its not reachable - via alert - where, when i right click and select ping from the dashboard host seems to be reachable.
Problem statement : Checking device reachability on zabbix via icmppingsec or icmpping. The reason for icmppingsec is also return response time in seconds.
utility installed : fping - all permissions proper.
Item:
Code:
Key : icmppingsec[,5,,,300,] Type of information: foat Units: s Update Interval: 180 History : 7 Trends: 180
Trigger:
Code:
{WAN-IP-Ping:icmppingsec[,5,,,300,].last(0)}=0
During the alert, if i check the reachablity, device seems reachable , and on the device, BGP seems fine so no flaps.
How do i debug these kind of issues? All in zabbix logs i see this,
Code:
88010:20180116:162256.645 In add_icmpping_item() addr:'172.X.X.X' count:5 interval:0 size:0 timeout:300 88010:20180116:162256.646 In add_pinger_host() addr:'172.X.X.X' 88010:20180116:162256.646 172.X.X.X 88010:20180116:162301.049 read line [172.X.X.X : - - - - -] 88010:20180116:162301.050 host [172.X.X.X] cnt=5 rcv=0 min=0.000000 max=0.000000 sum=0.000000

PS:
1).Would increasing packet count from 5 to 10 help?
2).Would decreaing interval from 3mintue to 1 minute help? - i know this would choke network and need tuning on zabbix end. I have close to 3.5K devices to poll and about 2 proxies plus one server doing that.
Server Conf:
Code:
LogFile=/var/log/zabbix/zabbix_server.log LogFileSize=300 DebugLevel=4 PidFile=/var/run/zabbix/zabbix_server.pid DBName=zabbix DBUser=XXXXX DBPassword=XXXXX StartPollers=185 StartIPMIPollers=1 StartPollersUnreachable=75 StartTrappers=65 StartPingers=95 StartDiscoverers=1 StartSNMPTrapper=1 ListenIP=0.0.0.0 HousekeepingFrequency=2 MaxHousekeeperDelete=300 SenderFrequency=360 CacheSize=1G CacheUpdateFrequency=300 StartDBSyncers=15 HistoryCacheSize=256M HistoryIndexCacheSize=256M TrendCacheSize=1G ValueCacheSize=128M Timeout=30 TrapperTimeout=180 UnreachablePeriod=600 UnavailableDelay=180 AlertScriptsPath=/etc/zabbix/alert.d/ FpingLocation=/usr/local/sbin/fping LogSlowQueries=300 StartProxyPollers=2 ProxyDataFrequency=180
Code:
Server=172.X.X.X Hostname=zbx-proxy1 LogFile=/var/log/zabbix/zabbix_proxy.log LogFileSize=325 DebugLevel=4 PidFile=/var/run/zabbix/zabbix_proxy.pid DBName=zabbix DBUser=XXXXXX DBPassword=XXXXXXX ProxyLocalBuffer=4 ProxyOfflineBuffer=4 ConfigFrequency=120 DataSenderFrequency=10 StartPollers=200 StartPollersUnreachable=100 StartTrappers=45 StartPingers=85 StartSNMPTrapper=1 CacheSize=1G StartDBSyncers=40 HistoryCacheSize=1G HistoryIndexCacheSize=128M Timeout=30 UnreachablePeriod=80 FpingLocation=/usr/local/sbin/fping LogSlowQueries=3000
Code:
Server=172.X.X.X Hostname=zbx-proxy2 LogFile=/var/log/zabbix/zabbix_proxy.log LogFileSize=300 DebugLevel=4 PidFile=/var/run/zabbix/zabbix_proxy.pid DBName=zabbix DBUser=XXXXX DBPassword=XXXXX ProxyLocalBuffer=3 ProxyOfflineBuffer=4 ConfigFrequency=120 DataSenderFrequency=30 StartPollers=215 StartPollersUnreachable=85 StartTrappers=40 StartPingers=80 StartSNMPTrapper=1 HousekeepingFrequency=3 CacheSize=1G StartDBSyncers=50 HistoryCacheSize=1G HistoryIndexCacheSize=1G Timeout=30 UnreachablePeriod=90 FpingLocation=/usr/local/sbin/fping LogSlowQueries=300
Thanks

Comment