I'm having problems with my windows agents, active agent items are being collected perfectly, but when I change it to a passive item.. then it stops collecting data. These had been working but it seemed like they quit working over a few week period. I don't think it coincided with an upgrade to teh agent, but it might have. I can't think of anything else that changed around that time that might cause this.
The agent is up and running on the client end.. from the server I can telnet to port 10050 but here is what happens..
And it just sits there, I do the same thing for a machine that is functioning and it returns the agent version and dies. I am seeing this with the 1.4.4 AND the 1.4.6 agent.
The servers with problems, 13 - windows 2003, 1 - windows 2000 server, 1 windows XP......
I have one 2000 server that is working with version 1.4.4 and one 2003 server with 1.4.6 that is working... I can't see a difference in config files, I look through the log files (with debug set to 4 and tried with 5) and only see information about the active checks that are running perfectly fine.
example config file
log file
The agent is up and running on the client end.. from the server I can telnet to port 10050 but here is what happens..
Code:
root@zabbix:~# telnet 10.2.1.94 10050 Trying 10.2.1.94... Connected to 10.2.1.94. Escape character is '^]'. agent.version
The servers with problems, 13 - windows 2003, 1 - windows 2000 server, 1 windows XP......
I have one 2000 server that is working with version 1.4.4 and one 2003 server with 1.4.6 that is working... I can't see a difference in config files, I look through the log files (with debug set to 4 and tried with 5) and only see information about the active checks that are running perfectly fine.
example config file
Code:
# This is config file for zabbix_agentd # To get more information about ZABBIX, go [URL]http://www.zabbix.com[/URL] ############ GENERAL PARAMETERS ################# # List of comma delimited IP addresses (or hostnames) of ZABBIX servers. # No spaces allowed. First entry is used for sending active checks. # Note that hostnames must resolve hostname->IP address and # IP address->hostname. Server=10.2.1.123 # Server port for sending active checks ServerPort=10051 # Unique hostname. Required for active checks. Hostname=aubreweigh # Listen port. Default is 10050 ListenPort=10050 # IP address to bind agent # If missing, bind to all available IPs ListenIP=10.2.1.94 # Number of pre-forked instances of zabbix_agentd. # Default value is 5 # This parameter must be between 1 and 16 StartAgents=2 # How often refresh list of active checks. 2 minutes by default. #RefreshActiveChecks=20 # Disable active checks. The agent will work in passive mode listening server. # DisableActive=1 #Enable remote commands for ZABBIX agent. By default remote commands disabled. EnableRemoteCommands=1 # Specifies debug level # 0 - debug is not created # 1 - critical information # 2 - error information # 3 - warnings (default) # 4 - for debugging (produces lots of information) DebugLevel=5 # Name of log file. # If not set, syslog will be used LogFile=c:\zabbix_agentd.log # Spend no more than Timeout seconds on processing # Must be between 1 and 30 Timeout=5 ####### USER-DEFINED MONITORED PARAMETERS ####### # Format: UserParameter=<key>,<shell command> # Note that shell command must not return empty string or EOL only #UserParameter=system.test,who|wc -l ### Set of parameter for monitoring MySQL server (v3.23.42 and later) ### Change -u<username> and add -p<password> if required #UserParameter=mysql.ping,mysqladmin -uroot ping|grep alive|wc -l #UserParameter=mysql.uptime,mysqladmin -uroot status|cut -f2 -d":"|cut -f1 -d"T" #UserParameter=mysql.threads,mysqladmin -uroot status|cut -f3 -d":"|cut -f1 -d"Q" #UserParameter=mysql.questions,mysqladmin -uroot status|cut -f4 -d":"|cut -f1 -d"S" #UserParameter=mysql.slowqueries,mysqladmin -uroot status|cut -f5 -d":"|cut -f1 -d"O" #UserParameter=mysql.qps,mysqladmin -uroot status|cut -f9 -d":" #UserParameter=mysql.version,mysql -V PerfCounter = interrupts,"\Processor(0)\Interrupts/sec",60
Code:
1480:20080730:175643 zabbix_agentd started. ZABBIX 1.4.6.
3900:20080730:175643 zabbix_agentd collector started
2340:20080730:175643 zabbix_agentd active check started [10.2.1.123:10051]
3900:20080731:110233 zabbix_agentd collector stopped
3996:20080731:110237 zabbix_agentd started. ZABBIX 1.4.6.
2348:20080731:110237 zabbix_agentd collector started
1080:20080731:110237 zabbix_agentd active check started [10.2.1.123:10051]
2348:20080731:110328 zabbix_agentd collector stopped
3552:20080731:110334 zabbix_agentd started. ZABBIX 1.4.6.
948:20080731:110334 zabbix_agentd collector started
2616:20080731:110334 zabbix_agentd active check started [10.2.1.123:10051]
2616:20080731:110334 In init_active_metrics()
2616:20080731:110334 In refresh_metrics('10.2.1.123',10051)
2616:20080731:110334 get_active_checks('10.2.1.123',10051)
948:20080731:110334 In GetCounterName() [index:6]
2616:20080731:110334 Sending [ZBX_GET_ACTIVE_CHECKS
aubreweigh
]
2616:20080731:110334 Before read
2616:20080731:110334 In parse_list_of_checks() [agent.version:86400:0
proc.num[]:240:0
system.cpu.load[,avg1]:120:0
system.cpu.util[,system,avg1]:60:0
system.swap.size[,free]:300:0
system.uname:3600:0
vfs.fs.size[c:,free]:1200:0
vfs.fs.size[c:,total]:86400:0
vm.memory.size[free]:150:0
vm.memory.size[total]:86400:0
ZBX_EOF
]
2616:20080731:110334 In disable_all_metrics()
2616:20080731:110334 Parsed [agent.version:86400:0]
2616:20080731:110334 In add_check('agent.version', 86400, 0)
2616:20080731:110334 Parsed [proc.num[]:240:0]
2616:20080731:110334 In add_check('proc.num[]', 240, 0)
2616:20080731:110334 Parsed [system.cpu.load[,avg1]:120:0]
2616:20080731:110334 In add_check('system.cpu.load[,avg1]', 120, 0)
2616:20080731:110334 Parsed [system.cpu.util[,system,avg1]:60:0]
2616:20080731:110334 In add_check('system.cpu.util[,system,avg1]', 60, 0)
2616:20080731:110334 Parsed [system.swap.size[,free]:300:0]
2616:20080731:110334 In add_check('system.swap.size[,free]', 300, 0)
2616:20080731:110334 Parsed [system.uname:3600:0]
2616:20080731:110334 In add_check('system.uname', 3600, 0)
2616:20080731:110334 Parsed [vfs.fs.size[c:,free]:1200:0]
948:20080731:110334 In GetCounterName() [index:238]
2616:20080731:110334 In add_check('vfs.fs.size[c:,free]', 1200, 0)
2616:20080731:110334 Parsed [vfs.fs.size[c:,total]:86400:0]
2616:20080731:110334 In add_check('vfs.fs.size[c:,total]', 86400, 0)
2616:20080731:110334 Parsed [vm.memory.size[free]:150:0]
2616:20080731:110334 In add_check('vm.memory.size[free]', 150, 0)
2616:20080731:110334 Parsed [vm.memory.size[total]:86400:0]
2616:20080731:110334 In add_check('vm.memory.size[total]', 86400, 0)
2616:20080731:110334 Parsed [ZBX_EOF]
2616:20080731:110334 In process_active_checks('10.2.1.123',10051)
2616:20080731:110334 For key [agent.version] received value [1.4.6]
948:20080731:110334 In GetCounterName() [index:6]
948:20080731:110334 In GetCounterName() [index:238]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>YWdlbnQudmVyc2lvbg==</key><data>MS40LjY=</data></req>]
948:20080731:110334 In GetCounterName() [index:44]
948:20080731:110334 In GetCounterName() [index:2]
2616:20080731:110334 OK
2616:20080731:110334 For key [proc.num[]] received value [29]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>cHJvYy5udW1bXQ==</key><data>Mjk=</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [system.cpu.load[,avg1]] received value [0.016667]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLmNwdS5sb2FkWyxhdmcxXQ==</key><data>MC4wMTY2Njc=</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [system.cpu.util[,system,avg1]] received value [1.650000]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLmNwdS51dGlsWyxzeXN0ZW0sYXZnMV0=</key><data>MS42NTAwMDA=</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [system.swap.size[,free]] received value [2315456512]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLnN3YXAuc2l6ZVssZnJlZV0=</key><data>MjMxNTQ1NjUxMg==</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [system.uname] received value [Windows AUBREWEIGH 5.1.2600 Windows XP Service Pack 2 Intel IA-32]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLnVuYW1l</key><data>V2luZG93cyBBVUJSRVdFSUdIIDUuMS4yNjAwIFdpbmRvd3MgWFAgU2VydmljZSBQYWNrIDIgSW50ZWwgSUEtMzI=</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [vfs.fs.size[c:,free]] received value [30691590144]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dmZzLmZzLnNpemVbYzosZnJlZV0=</key><data>MzA2OTE1OTAxNDQ=</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [vfs.fs.size[c:,total]] received value [39950184448]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dmZzLmZzLnNpemVbYzosdG90YWxd</key><data>Mzk5NTAxODQ0NDg=</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [vm.memory.size[free]] received value [595849216]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dm0ubWVtb3J5LnNpemVbZnJlZV0=</key><data>NTk1ODQ5MjE2</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 For key [vm.memory.size[total]] received value [1063337984]
2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dm0ubWVtb3J5LnNpemVbdG90YWxd</key><data>MTA2MzMzNzk4NA==</data></req>]
2616:20080731:110334 OK
2616:20080731:110334 In get_min_nextcheck()
2616:20080731:110334 Sleeping for 60 seconds
2616:20080731:110434 In process_active_checks('10.2.1.123',10051)
2616:20080731:110434 For key [system.cpu.util[,system,avg1]] received value [2.233333]
2616:20080731:110434 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLmNwdS51dGlsWyxzeXN0ZW0sYXZnMV0=</key><data>Mi4yMzMzMzM=</data></req>]
2616:20080731:110434 OK
2616:20080731:110434 In get_min_nextcheck()
2616:20080731:110434 Sleeping for 60 seconds
Comment