Good morning everybody, i have a problem with a few servers im trying to monitor and its driving me crazy. I added 5 new servers today, all within a private network (10.100.100.6 ... 10.100.100.11), and only one's working. All those servers have the same config, all active checks, and still, 5 of them give me the error "no active checks on server: host [] not found". I've checked 100 times that the hostname in the zabbix frontend matches the hostname in the agentd.conf, and tried several hostnames, all of different lengths, with/without numbers, etc..., and neither worked. When i turn up the debuglevel up to 4, all i see on the server's log is:
Same message in the zabbix server's side
I've checked that the zabbix server port 10051 is accesible from the servers with telnet, and that they both can connect succesfully. I've even tried to add the two of them in a specific vpn, and that didin't help, so connectivity issues are ruled out.
Does anybody have any idea on how to proceed? Any help would be deeply appreciated.
Thanks!!
Code:
22001:20150107:162637.478 Starting Zabbix Agent [lebfra]. Zabbix 2.0.12 (revision 45390).
22001:20150107:162637.479 In init_collector_data()
22001:20150107:162637.480 End of init_collector_data()
22002:20150107:162637.482 agent #0 started [collector]
22002:20150107:162637.483 In init_cpu_collector()
22003:20150107:162637.483 agent #1 started [active checks]
22003:20150107:162637.484 In init_active_metrics()
22003:20150107:162637.484 Buffer: first allocation for 100 elements
22002:20150107:162637.485 End of init_cpu_collector():SUCCEED
22002:20150107:162637.486 In update_cpustats()
22003:20150107:162637.484 In send_buffer() host:'200.45.235.164' port:10051 values:0/100
22003:20150107:162637.488 End of send_buffer():SUCCEED
22003:20150107:162637.488 refresh_active_checks() host:'200.45.235.164' port:10051
22002:20150107:162637.489 End of update_cpustats()
22003:20150107:162637.797 sending [{
"request":"active checks",
"host":"lebfra"}]
22003:20150107:162637.798 before read
22003:20150107:162638.428 got [{
"response":"failed",
"info":"host [lebfra] not found"}]
22003:20150107:162638.428 In parse_list_of_checks()
22003:20150107:162638.428 In disable_all_metrics()
22003:20150107:162638.429 no active checks on server: host [lebfra] not found
22003:20150107:162638.429 In process_active_checks('200.45.235.164',10051)
22003:20150107:162638.429 End of process_active_checks()
22003:20150107:162638.430 In get_min_nextcheck()
22003:20150107:162638.430 In send_buffer() host:'200.45.235.164' port:10051 values:0/100
22003:20150107:162638.430 End of send_buffer():SUCCEED
22003:20150107:162638.431 Sleeping for 1 second(s)
Same message in the zabbix server's side
Code:
28985:20150107:164446.988 cannot send list of active checks to [10.16.3.195]: host [lebfra] not found
Does anybody have any idea on how to proceed? Any help would be deeply appreciated.
Thanks!!
Comment