Ad Widget

Collapse

Problem with zabbix agent for windows

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • Alexei
    Founder, CEO
    Zabbix Certified Trainer
    Zabbix Certified SpecialistZabbix Certified Professional
    • Sep 2004
    • 5654

    #31
    Originally posted by oliverm
    if it doesnt affect the agent then perhaps we are suffering from two different problems as each agent with that problem also shows as being not available in the zabbix server page. Agents without that problem show as available.

    Could this be related at all? We can telnet to port 10050 on each of the agents from the zabbix box, so communication isnt the issue.

    Olly
    I am pretty sure it does not affect functionality of the agent. Please could you try release 4496 or higher (32 bit version only, 64 bit agent was not recompiled) when it becomes available at http://www.zabbix.com/developers.php and report if it works.

    I would appreciate this very much!
    Alexei Vladishev
    Creator of Zabbix, Product manager
    New York | Tokyo | Riga
    My Twitter

    Comment

    • wit
      Junior Member
      • Jul 2007
      • 8

      #32
      Originally posted by Alexei
      I am pretty sure it does not affect functionality of the agent. Please could you try release 4496 or higher (32 bit version only, 64 bit agent was not recompiled) when it becomes available at http://www.zabbix.com/developers.php and report if it works.

      I would appreciate this very much!
      Hi, Alexei.
      I have this problem too ( Call to PdhCollectQueryData() failed: No data to return.)
      I have just installed Pre 1.4.2 (stable) 4499 - Windows 2003 Sp1.
      I have no this error. Thank you.

      I will wait version 64 bit agent.

      Comment

      • Andreas Bollhalder
        Senior Member
        Zabbix Certified Specialist
        • Apr 2007
        • 144

        #33
        I have tested the win32 agent of 1.4.2pre on a W2k3SP2 and it still stops to report the values to the ZABBIX server. This time, I have been lucky to get the part in the agent log file.

        When everything is OK, the log shows
        Code:
          6908:20070802:140644 Before read
          6908:20070802:140644 In parse_list_of_checks('ZBX_EOF
        ')
          6908:20070802:140644 In disable_all_metrics()
          6908:20070802:140644 Parsed [ZBX_EOF]
          6908:20070802:140644 In process_active_checks('xxx.xxx.xxx.xxx',10051)
          6908:20070802:140644 In get_min_nextcheck()
          6908:20070802:140644 Sleeping for 60 seconds
          6672:20070802:140645 Processing request.
          6672:20070802:140645 In check_security()
          6672:20070802:140645 Requested [system.swap.size[,free]]
          6672:20070802:140645 Sending back [2702532608]
          6672:20070802:140645 Processing request.
          6672:20070802:140645 In check_security()
          6672:20070802:140645 Requested [vfs.fs.size[c:,pfree]]
          6672:20070802:140645 Sending back [42]
          6672:20070802:140646 Processing request.
          6672:20070802:140646 In check_security()
          6672:20070802:140646 Requested [vfs.fs.size[d:,pfree]]
          6672:20070802:140646 Sending back [45]
          6672:20070802:140647 Processing request.
          6672:20070802:140647 In check_security()
          6672:20070802:140647 Requested [vfs.fs.size[e:,pfree]]
          6672:20070802:140647 Sending back [98]
          6672:20070802:140648 Processing request.
          6672:20070802:140648 In check_security()
          6672:20070802:140648 Requested [vfs.fs.size[f:,pfree]]
          6672:20070802:140648 Sending back [86]
          6672:20070802:140652 Processing request.
          6672:20070802:140652 In check_security()
          6672:20070802:140652 Requested [vm.memory.size[free]]
          6672:20070802:140652 Sending back [458485760]
          6672:20070802:140652 Processing request.
          6672:20070802:140652 In check_security()
          6672:20070802:140652 Requested [agent.ping]
          6672:20070802:140652 Sending back [1]
          6672:20070802:140713 Processing request.
          6672:20070802:140713 In check_security()
          6672:20070802:140713 Requested [system.cpu.util[,,avg1]]
          6672:20070802:140713 Sending back [0.600000]
          6672:20070802:140721 Processing request.
          6672:20070802:140721 In check_security()
          6672:20070802:140721 Requested [vm.memory.size[free]]
          6672:20070802:140721 Sending back [458461184]
          6672:20070802:140722 Processing request.
          6672:20070802:140722 In check_security()
          6672:20070802:140722 Requested [agent.ping]
          6672:20070802:140722 Sending back [1]
          6672:20070802:140730 Processing request.
          6672:20070802:140730 In check_security()
          6672:20070802:140730 Requested [service_state[Microsoft Exchange Management]]
          6672:20070802:140730 Sending back [0]
          6672:20070802:140730 Processing request.
          6672:20070802:140730 In check_security()
          6672:20070802:140730 Requested [service_state[Microsoft Exchange Information Store]]
          6672:20070802:140730 Sending back [0]
          6672:20070802:140732 Processing request.
          6672:20070802:140732 In check_security()
          6672:20070802:140732 Requested [service_state[Microsoft Exchange MTA Stacks]]
          6672:20070802:140732 Sending back [0]
          6672:20070802:140733 Processing request.
          6672:20070802:140733 In check_security()
          6672:20070802:140733 Requested [service_state[Microsoft Exchange POP3]]
          6672:20070802:140733 Sending back [0]
          6672:20070802:140734 Processing request.
          6672:20070802:140734 In check_security()
          6672:20070802:140734 Requested [service_state[Microsoft Exchange Routing Engine]]
          6672:20070802:140734 Sending back [0]
          6672:20070802:140735 Processing request.
          6672:20070802:140735 In check_security()
          6672:20070802:140735 Requested [service_state[Microsoft Exchange System Attendant]]
          6672:20070802:140735 Sending back [0]
          6672:20070802:140736 Processing request.
          6672:20070802:140736 In check_security()
          6672:20070802:140736 Requested [service_state[Simple Mail Transfer Protocol (SMTP)]]
          6672:20070802:140736 Sending back [0]
          6672:20070802:140742 Processing request.
          6672:20070802:140742 In check_security()
          6672:20070802:140742 Requested [service_state[Backup Exec Remote Agent for Windows Servers]]
          6672:20070802:140742 Sending back [0]
          6672:20070802:140742 Processing request.
          6672:20070802:140742 In check_security()
          6672:20070802:140742 Requested [service_state[Symantec AntiVirus Client]]
          6672:20070802:140742 Sending back [0]
          6908:20070802:140744 In process_active_checks('xxx.xxx.xxx.xxx',10051)
          6908:20070802:140744 In get_min_nextcheck()
          6908:20070802:140744 Sleeping for 60 seconds
          6672:20070802:140744 Processing request.
          6672:20070802:140744 In check_security()
          6672:20070802:140744 Requested [system.cpu.util[,,avg1]]
          6672:20070802:140744 Sending back [1.150000]
          6672:20070802:140744 Processing request.
          6672:20070802:140744 In check_security()
          6672:20070802:140744 Requested [system.localtime]
          6672:20070802:140744 Sending back [1186056464]
          6672:20070802:140745 Processing request.
          6672:20070802:140745 In check_security()
          6672:20070802:140745 Requested [system.swap.size[,free]]
          6672:20070802:140745 Sending back [2702352384]
          6672:20070802:140745 Processing request.
          6672:20070802:140745 In check_security()
          6672:20070802:140745 Requested [vfs.fs.size[c:,pfree]]
          6672:20070802:140745 Sending back [42]
          6672:20070802:140746 Processing request.
          6672:20070802:140746 In check_security()
          6672:20070802:140746 Requested [vfs.fs.size[d:,pfree]]
          6672:20070802:140746 Sending back [45]
          6672:20070802:140747 Processing request.
          6672:20070802:140747 In check_security()
          6672:20070802:140747 Requested [vfs.fs.size[e:,pfree]]
          6672:20070802:140747 Sending back [98]
          6672:20070802:140748 Processing request.
          6672:20070802:140748 In check_security()
          6672:20070802:140748 Requested [vfs.fs.size[f:,pfree]]
          6672:20070802:140748 Sending back [86]
          6672:20070802:140752 Processing request.
          6672:20070802:140752 In check_security()
          6672:20070802:140752 Requested [vm.memory.size[free]]
          6672:20070802:140752 Sending back [458440704]
          6672:20070802:140752 Processing request.
          6672:20070802:140752 In check_security()
          6672:20070802:140752 Requested [agent.ping]
          6672:20070802:140752 Sending back [1]
          6672:20070802:140813 Processing request.
          6672:20070802:140813 In check_security()
          6672:20070802:140813 Requested [system.cpu.util[,,avg1]]
          6672:20070802:140813 Sending back [0.983333]
          6672:20070802:140821 Processing request.
          6672:20070802:140821 In check_security()
          6672:20070802:140821 Requested [vm.memory.size[free]]
          6672:20070802:140821 Sending back [458276864]
          6672:20070802:140823 Processing request.
          6672:20070802:140823 In check_security()
          6672:20070802:140823 Requested [agent.ping]
          6672:20070802:140823 Sending back [1]
          6672:20070802:140826 Processing request.
          6672:20070802:140826 In check_security()
          6672:20070802:140826 Requested [service_state[Time Synchronization]]
          6672:20070802:140826 Sending back [0]
          6672:20070802:140827 Processing request.
          6672:20070802:140827 In check_security()
          6672:20070802:140827 Requested [service_state[Terminal Services]]
          6672:20070802:140827 Sending back [0]
          6908:20070802:140844 In refresh_metrics('xxx.xxx.xxx.xxx',10051)
          6908:20070802:140844 get_active_checks('xxx.xxx.xxx.xxx',10051)
          6908:20070802:140844 Sending [ZBX_GET_ACTIVE_CHECKS
        chzhwi50.opti.net
        ]
        When the agent died, I only get
        Code:
          6908:20070802:141044 Before read
          6908:20070802:141044 In parse_list_of_checks('ZBX_EOF
        ')
          6908:20070802:141044 In disable_all_metrics()
          6908:20070802:141044 Parsed [ZBX_EOF]
          6908:20070802:141044 In process_active_checks('xxx.xxx.xxx.xxx',10051)
          6908:20070802:141044 In get_min_nextcheck()
          6908:20070802:141044 Sleeping for 60 seconds
          6908:20070802:141144 In process_active_checks('xxx.xxx.xxx.xxx',10051)
          6908:20070802:141144 In get_min_nextcheck()
          6908:20070802:141144 Sleeping for 60 seconds
          6908:20070802:141244 In refresh_metrics('xxx.xxx.xxx.xxx',10051)
          6908:20070802:141244 get_active_checks('xxx.xxx.xxx.xxx',10051)
          6908:20070802:141244 Sending [ZBX_GET_ACTIVE_CHECKS
        chzhwi50.opti.net
        ]
        In the host configuration, I it shows
        Code:
        n_icmp-ping			02 Aug 17:37:34		1 		-
        n_tcp-pop3-reachable		02 Aug 14:09:27		1 		-
        n_tcp-rdp-reachable		02 Aug 14:08:43		1 		-
        n_tcp-smtp-reachable		02 Aug 14:09:35		1 		-
        s_mes-information-store-state	02 Aug 14:09:29		Running (0) 	-
        s_mes-management-state		02 Aug 14:09:29		Running (0) 	-
        s_mes-mta-stacks-state		02 Aug 14:09:31		Running (0) 	-
        s_mes-pop3-state		02 Aug 14:09:31		Running (0) 	-
        s_mes-routing-engine-state	02 Aug 14:09:32		Running (0) 	-
        s_mes-smtp-state		02 Aug 14:09:34		Running (0) 	-
        s_mes-system-attendant-state	02 Aug 14:09:33		Running (0) 	-
        s_rdp-state			02 Aug 14:08:25		Running (0) 	-
        s_sav-client-state		02 Aug 14:09:41		Running (0) 	-
        s_time-synchronization-state	02 Aug 14:08:24		Running (0) 	-
        s_vbe-remote-agent-state	02 Aug 14:09:40		Running (0) 	-
        v_cpu-usage			02 Aug 14:09:42		1.03 % 		+0.50 %
        v_localtime			02 Aug 14:09:42		1186056584 	+60
        v_memory-free			02 Aug 14:09:20		437.27 MB 	+92 KB
        v_partition-c:-free		02 Aug 14:08:44		42 % 		-
        v_partition-d:-free		02 Aug 14:08:45		45 % 		-
        v_partition-e:-free		02 Aug 14:08:47		98 % 		-
        v_partition-f:-free		02 Aug 14:08:47		86 % 		-
        v_swap-free			02 Aug 14:08:43		2.52 GB 	+280 KB
        v_zabbix-agent-ping		02 Aug 14:09:22		Up (1) 		-
        I can't tell what exactly happens to the agent. It seems to me, that it tries to process a second time the active checks and can't succeed.

        Any ideas ?

        Andreas
        Zabbix statistics
        Total hosts: 380 - Total items: 12190 - Total triggers: 4530 - Required server performance: 224.2

        Comment

        • Andreas Bollhalder
          Senior Member
          Zabbix Certified Specialist
          • Apr 2007
          • 144

          #34
          My previews message got too long.

          The server is 1.4.1 and the log shows
          Code:
           30247:20070802:123324 Deleted 11082 records from history and trends
           30221:20070802:130743 Get value from agent failed. Error: ZBX_TCP_READ() failed [Connection reset by peer]
           30221:20070802:130743 Host [server]: first network error, wait for 15 seconds
           30221:20070802:130743 Parameter [system.swap.size[,free]] will be checked after 240 seconds on host [server]
           30223:20070802:131221 Get value from agent failed. Error: ZBX_TCP_READ() failed [Connection reset by peer]
           30223:20070802:131221 Host [server]: first network error, wait for 15 seconds
           30223:20070802:131221 Parameter [agent.ping] will be checked after 120 seconds on host [server]
           30247:20070802:133426 Executing housekeeper
           30247:20070802:133445 Deleted 42122 records from history and trends
           30221:20070802:140943 Get value from agent failed. Error: ZBX_TCP_READ() failed [Connection reset by peer]
           30221:20070802:140943 Host [server]: first network error, wait for 15 seconds
           30221:20070802:140943 Parameter [system.swap.size[,free]] will be checked after 240 seconds on host [server]
           30251:20070802:141003 Timeout while answering request
          Andreas
          Last edited by Andreas Bollhalder; 03-08-2007, 07:55.
          Zabbix statistics
          Total hosts: 380 - Total items: 12190 - Total triggers: 4530 - Required server performance: 224.2

          Comment

          • Andreas Bollhalder
            Senior Member
            Zabbix Certified Specialist
            • Apr 2007
            • 144

            #35
            Hallo all

            I'm still having the problem with the agent and server of 1.4.2 stable.

            Any news on this ?

            Andreas
            Zabbix statistics
            Total hosts: 380 - Total items: 12190 - Total triggers: 4530 - Required server performance: 224.2

            Comment

            • swaterhouse
              Senior Member
              • Apr 2006
              • 268

              #36
              I am not seeing any issues like this on Win 2000, and Win 2003 (both 32bit and 64bit).

              Is this just one agent that has an issue?

              Comment

              • Andreas Bollhalder
                Senior Member
                Zabbix Certified Specialist
                • Apr 2007
                • 144

                #37
                Yes, I have about 50 servers with the ZABBIX agent and the problem shows up only on one of them. The problem has been for all version I tried.

                Andreas
                Zabbix statistics
                Total hosts: 380 - Total items: 12190 - Total triggers: 4530 - Required server performance: 224.2

                Comment

                • NB-beheer
                  Junior Member
                  • May 2007
                  • 11

                  #38
                  I installed the 1.4.2 Windows client. Still the same problem.

                  Comment

                  • Alexei
                    Founder, CEO
                    Zabbix Certified Trainer
                    Zabbix Certified SpecialistZabbix Certified Professional
                    • Sep 2004
                    • 5654

                    #39
                    Originally posted by NB-beheer
                    I installed the 1.4.2 Windows client. Still the same problem.
                    What problem? The thread is too long to figure out what we are discussing...
                    Alexei Vladishev
                    Creator of Zabbix, Product manager
                    New York | Tokyo | Riga
                    My Twitter

                    Comment

                    • NB-beheer
                      Junior Member
                      • May 2007
                      • 11

                      #40
                      Originally posted by Alexei
                      What problem? The thread is too long to figure out what we are discussing...
                      Yeah, I was already looking where my previous post was
                      It's #17. The message on the server side is the same.
                      I've now turned on level 5 debugging on the clients. It was reset because of the installation of 1.4.2.
                      I'll post results when one of the clients hang again. It's actually two servers out of ten which have this problem. All are Windows 2003.

                      Comment

                      • Alexei
                        Founder, CEO
                        Zabbix Certified Trainer
                        Zabbix Certified SpecialistZabbix Certified Professional
                        • Sep 2004
                        • 5654

                        #41
                        So, you are talking about these messages, right?

                        Code:
                         3676:20070716:093952 Call to PdhCollectQueryData() failed: No data to return.
                        3676:20070716:093953 Call to PdhCollectQueryData() failed: No data to return.
                        3676:20070716:093954 Call to PdhCollectQueryData() failed: No data to return.
                        3676:20070716:093955 Call to PdhCollectQueryData() failed: No data to return.
                        I do not believe 1.4.2 agent still generates them...
                        Alexei Vladishev
                        Creator of Zabbix, Product manager
                        New York | Tokyo | Riga
                        My Twitter

                        Comment

                        • NB-beheer
                          Junior Member
                          • May 2007
                          • 11

                          #42
                          Originally posted by Alexei
                          So, you are talking about these messages, right?

                          Code:
                           3676:20070716:093952 Call to PdhCollectQueryData() failed: No data to return.
                          3676:20070716:093953 Call to PdhCollectQueryData() failed: No data to return.
                          3676:20070716:093954 Call to PdhCollectQueryData() failed: No data to return.
                          3676:20070716:093955 Call to PdhCollectQueryData() failed: No data to return.
                          I do not believe 1.4.2 agent still generates them...
                          Agreed. I don't see those anymore.
                          Looks like there were two separate problems. One is solved by 1.4.2.
                          I'll post logfiles as soon as it hangs again.

                          Comment

                          • NB-beheer
                            Junior Member
                            • May 2007
                            • 11

                            #43
                            See attachment from our Windows client.
                            At 0:34 the server sent an unreachable message.
                            Attached Files

                            Comment

                            • eger
                              Member
                              • Nov 2006
                              • 95

                              #44
                              I am still getting this on 1.4.2:

                              Code:
                              zabbix_agentd.exe [3264]: Error: Call to PdhCollectQueryData() failed: No data to return.
                              
                              zabbix_agentd.exe [3264]: Error: Call to PdhCollectQueryData() failed: No data to return.
                              
                              zabbix_agentd.exe [3264]: Error: Call to PdhCollectQueryData() failed: No data to return.
                              
                              zabbix_agentd.exe [3264]: Error: Call to PdhCollectQueryData() failed: No data to return.
                              
                              zabbix_agentd.exe [3264]: Error: Call to PdhCollectQueryData() failed: No data to return.
                              Any ideas?

                              Comment

                              • Frodo
                                Junior Member
                                • Jun 2007
                                • 25

                                #45
                                Yesterday i have done the upgrade from 1.4.1 to 1.4.2 without any problems.
                                On my WIN2K3 Servers with zabbix agent 1.4.2 the PdhCollectQueryData() entries are NOT longer in the log file.

                                It seems to be fixed.

                                But the wrong swap size values from windows are still present.

                                Comment

                                Working...