Ad Widget

Collapse

windows agent problem

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • dantheman
    Senior Member
    • May 2006
    • 209

    #1

    windows agent problem

    I'm having problems with my windows agents, active agent items are being collected perfectly, but when I change it to a passive item.. then it stops collecting data. These had been working but it seemed like they quit working over a few week period. I don't think it coincided with an upgrade to teh agent, but it might have. I can't think of anything else that changed around that time that might cause this.

    The agent is up and running on the client end.. from the server I can telnet to port 10050 but here is what happens..
    Code:
    root@zabbix:~# telnet 10.2.1.94 10050
    Trying 10.2.1.94...
    Connected to 10.2.1.94.
    Escape character is '^]'.
    agent.version
    And it just sits there, I do the same thing for a machine that is functioning and it returns the agent version and dies. I am seeing this with the 1.4.4 AND the 1.4.6 agent.

    The servers with problems, 13 - windows 2003, 1 - windows 2000 server, 1 windows XP......

    I have one 2000 server that is working with version 1.4.4 and one 2003 server with 1.4.6 that is working... I can't see a difference in config files, I look through the log files (with debug set to 4 and tried with 5) and only see information about the active checks that are running perfectly fine.

    example config file
    Code:
    # This is config file for zabbix_agentd
    # To get more information about ZABBIX, go [URL]http://www.zabbix.com[/URL]
    ############ GENERAL PARAMETERS #################
    # List of comma delimited IP addresses (or hostnames) of ZABBIX servers. 
    # No spaces allowed. First entry is used for sending active checks.
    # Note that hostnames must resolve hostname->IP address and
    # IP address->hostname.
    Server=10.2.1.123
    # Server port for sending active checks
    ServerPort=10051
    # Unique hostname. Required for active checks.
    Hostname=aubreweigh
    # Listen port. Default is 10050
    ListenPort=10050
    # IP address to bind agent
    # If missing, bind to all available IPs
    ListenIP=10.2.1.94
    # Number of pre-forked instances of zabbix_agentd.
    # Default value is 5
    # This parameter must be between 1 and 16
    StartAgents=2
    # How often refresh list of active checks. 2 minutes by default.
    #RefreshActiveChecks=20
    # Disable active checks. The agent will work in passive mode listening server.
    # DisableActive=1
    #Enable remote commands for ZABBIX agent. By default remote commands disabled.
    EnableRemoteCommands=1
    # Specifies debug level
    # 0 - debug is not created
    # 1 - critical information
    # 2 - error information
    # 3 - warnings (default)
    # 4 - for debugging (produces lots of information)
    DebugLevel=5
    # Name of log file.
    # If not set, syslog will be used
    LogFile=c:\zabbix_agentd.log
    # Spend no more than Timeout seconds on processing
    # Must be between 1 and 30
    Timeout=5
    ####### USER-DEFINED MONITORED PARAMETERS #######
    # Format: UserParameter=<key>,<shell command>
    # Note that shell command must not return empty string or EOL only
    #UserParameter=system.test,who|wc -l
    ### Set of parameter for monitoring MySQL server (v3.23.42 and later)
    ### Change -u<username> and add -p<password> if required
    #UserParameter=mysql.ping,mysqladmin -uroot ping|grep alive|wc -l
    #UserParameter=mysql.uptime,mysqladmin -uroot status|cut -f2 -d":"|cut -f1 -d"T"
    #UserParameter=mysql.threads,mysqladmin -uroot status|cut -f3 -d":"|cut -f1 -d"Q"
    #UserParameter=mysql.questions,mysqladmin -uroot status|cut -f4 -d":"|cut -f1 -d"S"
    #UserParameter=mysql.slowqueries,mysqladmin -uroot status|cut -f5 -d":"|cut -f1 -d"O"
    #UserParameter=mysql.qps,mysqladmin -uroot status|cut -f9 -d":"
    #UserParameter=mysql.version,mysql -V
    PerfCounter = interrupts,"\Processor(0)\Interrupts/sec",60
    log file

    Code:
      1480:20080730:175643 zabbix_agentd started. ZABBIX 1.4.6.
      3900:20080730:175643 zabbix_agentd collector started
      2340:20080730:175643 zabbix_agentd active check started [10.2.1.123:10051]
      3900:20080731:110233 zabbix_agentd collector stopped
      3996:20080731:110237 zabbix_agentd started. ZABBIX 1.4.6.
      2348:20080731:110237 zabbix_agentd collector started
      1080:20080731:110237 zabbix_agentd active check started [10.2.1.123:10051]
      2348:20080731:110328 zabbix_agentd collector stopped
      3552:20080731:110334 zabbix_agentd started. ZABBIX 1.4.6.
       948:20080731:110334 zabbix_agentd collector started
      2616:20080731:110334 zabbix_agentd active check started [10.2.1.123:10051]
      2616:20080731:110334 In init_active_metrics()
      2616:20080731:110334 In refresh_metrics('10.2.1.123',10051)
      2616:20080731:110334 get_active_checks('10.2.1.123',10051)
       948:20080731:110334 In GetCounterName() [index:6]
      2616:20080731:110334 Sending [ZBX_GET_ACTIVE_CHECKS
    aubreweigh
    ]
      2616:20080731:110334 Before read
      2616:20080731:110334 In parse_list_of_checks() [agent.version:86400:0
    proc.num[]:240:0
    system.cpu.load[,avg1]:120:0
    system.cpu.util[,system,avg1]:60:0
    system.swap.size[,free]:300:0
    system.uname:3600:0
    vfs.fs.size[c:,free]:1200:0
    vfs.fs.size[c:,total]:86400:0
    vm.memory.size[free]:150:0
    vm.memory.size[total]:86400:0
    ZBX_EOF
    ]
      2616:20080731:110334 In disable_all_metrics()
      2616:20080731:110334 Parsed [agent.version:86400:0]
      2616:20080731:110334 In add_check('agent.version', 86400, 0)
      2616:20080731:110334 Parsed [proc.num[]:240:0]
      2616:20080731:110334 In add_check('proc.num[]', 240, 0)
      2616:20080731:110334 Parsed [system.cpu.load[,avg1]:120:0]
      2616:20080731:110334 In add_check('system.cpu.load[,avg1]', 120, 0)
      2616:20080731:110334 Parsed [system.cpu.util[,system,avg1]:60:0]
      2616:20080731:110334 In add_check('system.cpu.util[,system,avg1]', 60, 0)
      2616:20080731:110334 Parsed [system.swap.size[,free]:300:0]
      2616:20080731:110334 In add_check('system.swap.size[,free]', 300, 0)
      2616:20080731:110334 Parsed [system.uname:3600:0]
      2616:20080731:110334 In add_check('system.uname', 3600, 0)
      2616:20080731:110334 Parsed [vfs.fs.size[c:,free]:1200:0]
       948:20080731:110334 In GetCounterName() [index:238]
      2616:20080731:110334 In add_check('vfs.fs.size[c:,free]', 1200, 0)
      2616:20080731:110334 Parsed [vfs.fs.size[c:,total]:86400:0]
      2616:20080731:110334 In add_check('vfs.fs.size[c:,total]', 86400, 0)
      2616:20080731:110334 Parsed [vm.memory.size[free]:150:0]
      2616:20080731:110334 In add_check('vm.memory.size[free]', 150, 0)
      2616:20080731:110334 Parsed [vm.memory.size[total]:86400:0]
      2616:20080731:110334 In add_check('vm.memory.size[total]', 86400, 0)
      2616:20080731:110334 Parsed [ZBX_EOF]
      2616:20080731:110334 In process_active_checks('10.2.1.123',10051)
      2616:20080731:110334 For key [agent.version] received value [1.4.6]
       948:20080731:110334 In GetCounterName() [index:6]
       948:20080731:110334 In GetCounterName() [index:238]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>YWdlbnQudmVyc2lvbg==</key><data>MS40LjY=</data></req>]
       948:20080731:110334 In GetCounterName() [index:44]
       948:20080731:110334 In GetCounterName() [index:2]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [proc.num[]] received value [29]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>cHJvYy5udW1bXQ==</key><data>Mjk=</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [system.cpu.load[,avg1]] received value [0.016667]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLmNwdS5sb2FkWyxhdmcxXQ==</key><data>MC4wMTY2Njc=</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [system.cpu.util[,system,avg1]] received value [1.650000]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLmNwdS51dGlsWyxzeXN0ZW0sYXZnMV0=</key><data>MS42NTAwMDA=</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [system.swap.size[,free]] received value [2315456512]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLnN3YXAuc2l6ZVssZnJlZV0=</key><data>MjMxNTQ1NjUxMg==</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [system.uname] received value [Windows AUBREWEIGH 5.1.2600 Windows XP Service Pack 2 Intel IA-32]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLnVuYW1l</key><data>V2luZG93cyBBVUJSRVdFSUdIIDUuMS4yNjAwIFdpbmRvd3MgWFAgU2VydmljZSBQYWNrIDIgSW50ZWwgSUEtMzI=</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [vfs.fs.size[c:,free]] received value [30691590144]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dmZzLmZzLnNpemVbYzosZnJlZV0=</key><data>MzA2OTE1OTAxNDQ=</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [vfs.fs.size[c:,total]] received value [39950184448]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dmZzLmZzLnNpemVbYzosdG90YWxd</key><data>Mzk5NTAxODQ0NDg=</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [vm.memory.size[free]] received value [595849216]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dm0ubWVtb3J5LnNpemVbZnJlZV0=</key><data>NTk1ODQ5MjE2</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 For key [vm.memory.size[total]] received value [1063337984]
      2616:20080731:110334 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>dm0ubWVtb3J5LnNpemVbdG90YWxd</key><data>MTA2MzMzNzk4NA==</data></req>]
      2616:20080731:110334 OK
      2616:20080731:110334 In get_min_nextcheck()
      2616:20080731:110334 Sleeping for 60 seconds
      2616:20080731:110434 In process_active_checks('10.2.1.123',10051)
      2616:20080731:110434 For key [system.cpu.util[,system,avg1]] received value [2.233333]
      2616:20080731:110434 XML before sending [<req><host>YXVicmV3ZWlnaA==</host><key>c3lzdGVtLmNwdS51dGlsWyxzeXN0ZW0sYXZnMV0=</key><data>Mi4yMzMzMzM=</data></req>]
      2616:20080731:110434 OK
      2616:20080731:110434 In get_min_nextcheck()
      2616:20080731:110434 Sleeping for 60 seconds
  • dantheman
    Senior Member
    • May 2006
    • 209

    #2
    I'm not sure what else to try/check here... if anyone else has any ideas, I'd really appreciate it. ...

    It just looks like the agent is not returning a value and just sitting, but I don't know how to kick start it to get it working again. If I run netstat -a on the windows machine I see zabbix waiting for a connection on port 10050, and then I also see an accumulation of connections on port 10050 from the zabbix server. So it looks like they are even taking a while to time out, if they are at all.

    THanks,
    Dan

    Comment

    • trikke
      Senior Member
      • Aug 2007
      • 140

      #3
      Hi dan,

      just to be shure, your hostname in zabbix has to be exactly the same as the "Hostname=" Parm in the Agents Configuration file.
      I see that u bind your agent to a specific IP. What does nslookup for that ip tells?? ( reverse lookup problem?).
      Else try to sniff the traffic on 10050 and 10051, thsi might give u some more hints,

      Greets
      Patrick

      Comment

      Working...