first,thanks to zabbix developer team for this great tool.
here is my problem. i monitored about 50 hosts with zabbix and everything going fine. and today i tried to add another two hosts. these servers are two nodes of an oracle rac,so they have the same hardwares and same OS(rhel5.5).
both server has the green "Z" in zabbix frontend. but one of them has gap in graphs. and some items show as "no data". the other one has nothing like this.
i look into zabbix server log, found something like:
this is the agent config:
i got nothing unusual information in agent log.
if try to pull data with zabbix_get from zabbix server. 8 of 10 can get data back, and 2 will end in time out error. but with ping ,there is no packet lost and time less than 0.5ms.
i'm sure there are no iptable rules in both zabbix server and this error host.
and i can see zabbix_agentd listen on 10050.
before post this thread,i did some google. had tried something like increase "startpollers","Timeout". but no lucky.
any suggestion is appreciated
my zabbix server version is 2.2.3 and agent version is 2.2.1
here is my problem. i monitored about 50 hosts with zabbix and everything going fine. and today i tried to add another two hosts. these servers are two nodes of an oracle rac,so they have the same hardwares and same OS(rhel5.5).
both server has the green "Z" in zabbix frontend. but one of them has gap in graphs. and some items show as "no data". the other one has nothing like this.
i look into zabbix server log, found something like:
10942:20140527:210156.778 Zabbix agent item "vfs.fs.size[/stage,free]" on host "my server name" failed: first network error, wait for 15 seconds.
10958:20140527:210211.901 resuming Zabbix agent checks on host "my host name": connection restored
10958:20140527:210211.901 resuming Zabbix agent checks on host "my host name": connection restored
LogFile=/var/log/zabbix/zabbix_agentd.log
Server=192.168.xx.xx
Hostname=my host name
Server=192.168.xx.xx
Hostname=my host name
if try to pull data with zabbix_get from zabbix server. 8 of 10 can get data back, and 2 will end in time out error. but with ping ,there is no packet lost and time less than 0.5ms.
i'm sure there are no iptable rules in both zabbix server and this error host.
and i can see zabbix_agentd listen on 10050.
before post this thread,i did some google. had tried something like increase "startpollers","Timeout". but no lucky.
any suggestion is appreciated
my zabbix server version is 2.2.3 and agent version is 2.2.1
Comment