Ad Widget

Collapse

Somebody help me !! URGENT!!

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • Bharathu
    Member
    • Dec 2005
    • 56

    #1

    Somebody help me !! URGENT!!

    Hi!!

    I'm using Zabbix 1.1 beta 4. The zabbix server was installed in Linux. I was not able to get my Windows agent communicate with my zabbix server. i always get this message in my zabbix_agentd.log.

    [09-Jan-2006 14:42:32] *************** Log file opened ****************
    [09-Jan-2006 14:42:32] Collector thread initialized successfully
    [09-Jan-2006 14:42:32] Zabbix Win32 Agent started
    [09-Jan-2006 14:42:53] Active checks [Cannot connect to [10.239.18.13:10051] [No error]]


    I am enclosing the configuration details of my server & agent.

    **********ZABBIX_SERVER.CONF *********

    Server=1
    StartSuckers=6
    StartTrappers=5
    ListenPort=10051
    SenderFrequency=30
    DisableHousekeeping=1
    DebugLevel=3
    Timeout=5
    LogFile=/tmp/zabbix_server.log

    DBHost=localhost
    DBName=zabbix
    DBUser=root

    #DBSocket=/tmp/mysql.sock

    ***********ZABBIX_AGENTD.CONF*****************

    Server=10.239.18.13
    ServerPort=10051
    Hostname=10.239.19.120
    ListenPort=10050
    #ListenIP=10.239.18.13
    StartAgents=5
    #DisableActive=1
    DebugLevel=3
    LogFile=C:\zabbix_agentd.log
    Timeout=3
    Last edited by Bharathu; 11-01-2006, 13:50. Reason: To REDUCE length !!
  • Nate Bell
    Senior Member
    • Feb 2005
    • 141

    #2
    I'll get the ball rolling. I haven't ever tried running a Zabbix Agent on a Win32 box. That said, I have used Windows extensively otherwise, and my first guess would be that your agent can't connect because a firewall running on Windows is blocking the port Zabbix uses.

    Nate

    Comment

    • cameronsto
      Senior Member
      • Oct 2005
      • 148

      #3
      Is there a firewall between the agent and server blocking requests to port 10051?

      -cameron

      Comment

      • elkor
        Senior Member
        • Jul 2005
        • 299

        #4
        Are you only using active checks for the windows box? regardless of a firewall blocking inboud 10051 traffic, if outbound (server -> agent) 10050 traffic is allowed "zabbix agent" type items should function regardless of this message being present in the agent's log file

        Comment

        • Bharathu
          Member
          • Dec 2005
          • 56

          #5
          No firewall on my PC

          Firstly thanks for ur replies....but i dont have any firewalls installed on my PC. Also i ran the agent in 2 modes i.e. "c:\zabbixw32 start" and also in "c:\zabbixw32 standalone". In both ways i get the same error in my log file. In my web interface i see 0 in the field of "number of values stored".

          When i ran my server on the linux box, it was creating 11 threads in total. But when i run the command "strace zabbix_server" i get this message....as follows.


          *****************************************
          rt_sigaction(SIGPIPE, {SIG_IGN}, {SIG_DFL}, 8) = 0
          socket(PF_FILE, SOCK_STREAM, 0) = 4
          fcntl64(4, F_GETFL) = 0x2 (flags O_RDWR)
          connect(4, {sa_family=AF_FILE, path="/var/lib/mysql/mysql.sock"}, 110) = 0
          setsockopt(4, SOL_IP, IP_TOS, [8], 4) = -1 EOPNOTSUPP (Operation not supported)setsockopt(4, SOL_SOCKET, SO_KEEPALIVE, [1], 4) = 0
          read(4, "9\0\0\0", 4) = 4
          read(4, "\n4.1.0-alpha-standard-log\0\333#\0\0t!"..., 57) = 57
          stat64("/usr/share/mysql/charsets/Index.xml", {st_mode=S_IFREG|0777, st_size=17147, ...}) = 0
          open("/usr/share/mysql/charsets/Index.xml", O_RDONLY|O_LARGEFILE) = 5
          read(5, "<?xml version=\'1.0\' encoding=\"ut"..., 17147) = 17147
          close(5) = 0
          write(4, "\22\0\0\1\215\240\0\0\0root\0\0zabbix\0", 22) = 22
          read(4, "\5\0\0\2", 4) = 4
          read(4, "\0\0\0\2\0", 5) = 5
          fcntl64(4, F_SETFL, O_RDWR|O_NONBLOCK) = 0
          read(4, 0x87d6ac8, 8192) = -1 EAGAIN (Resource temporarily unavailable)fcntl64(4, F_SETFL, O_RDWR) = 0
          write(4, "\7\0\0\0\2zabbix", 11) = 11
          read(4, "\5\0\0\1", 4) = 4
          read(4, "\0\0\0\2\0", 5) = 5
          fcntl64(4, F_SETFL, O_RDWR|O_NONBLOCK) = 0
          read(4, 0x87d6ac8, 8192) = -1 EAGAIN (Resource temporarily unavailable)fcntl64(4, F_SETFL, O_RDWR) = 0
          write(4, "\'\0\0\0\3select refresh_unsupported "..., 43) = 43
          read(4, "\1\0\0\1", 4) = 4
          read(4, "\1", 1) = 1
          read(4, "%\0\0\2", 4) = 4
          read(4, "\6config\23refresh_unsupported\3\4\0\0\1"..., 37) = 37
          read(4, "\1\0\0\3", 4) = 4
          read(4, "\376", 1) = 1
          read(4, "\4\0\0\4", 4) = 4
          read(4, "\003600", 4) = 4
          read(4, "\1\0\0\5", 4) = 4
          read(4, "\376", 1) = 1
          getuid32() = 0
          socket(PF_FILE, SOCK_STREAM, 0) = 5
          fcntl64(5, F_GETFL) = 0x2 (flags O_RDWR)
          fcntl64(5, F_SETFL, O_RDWR|O_NONBLOCK) = 0
          connect(5, {sa_family=AF_FILE, path="/var/run/nscd/socket"}, 110) = -1 ENOENT (No such file or directory)close(5) = 0
          socket(PF_FILE, SOCK_STREAM, 0) = 5
          fcntl64(5, F_GETFL) = 0x2 (flags O_RDWR)
          fcntl64(5, F_SETFL, O_RDWR|O_NONBLOCK) = 0
          connect(5, {sa_family=AF_FILE, path="/var/run/nscd/socket"}, 110) = -1 ENOENT (No such file or directory)
          close(5) = 0
          open("/etc/passwd", O_RDONLY) = 5
          fcntl64(5, F_GETFD) = 0
          fcntl64(5, F_SETFD, FD_CLOEXEC) = 0
          fstat64(5, {st_mode=S_IFREG|0644, st_size=2138, ...}) = 0
          mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0xb7fe5000
          read(5, "root:x:0:0:root:/root:/bin/bash\n"..., 4096) = 2138
          close(5) = 0
          munmap(0xb7fe5000, 4096) = 0
          setgid32(502) = 0
          setuid32(502) = 0
          setresgid32(-1, 502, -1) = 0
          setresuid32(-1, 502, -1) = 0
          clone(child_stack=0, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGC HLD, child_tidptr=0xb7fe6708) = 27765
          File [/tmp/zabbix_server.pid] exists. Is this process already running ?
          --- SIGCHLD (Child exited) @ 0 (0) ---
          exit_group(0) = ?
          ******************************************

          I'm a total newbie in Linux stuffs....just 2 weeks into linux. I dont know how to make the head and tail of the strace o/p i got..... any ideas?
          waiting for replies guys......

          Regards,
          Bharathu

          Comment

          • elkor
            Senior Member
            • Jul 2005
            • 299

            #6
            Ok,

            one thing at a time. firstly you have your zabbix server

            next you have the box you're monitoring (your pc?) regardless of operating system.

            on the server, through the web interface you must first add the host to be monitored in the configuration/hosts section. When that is done you must add items that you want to monitor for that host in the configuration/items section. To be useful for alerts you should add triggers for the items as well in configuration/triggers but that can be done at a later date.. let's just verify you can collect data.

            Add an item that is checked somewhat frequently and will show activity.. such as cpu load or something. For simplicity sake, have it be of a "zabbix agent" type.

            then fire up the agent on the machine to be monitored, don't worry about the could not communicate with server messages for now. That message just means that the agent can't "call home" to the server to see if it has any active checks to make; it is 100% possible that the system could be working just fine and you still get that message in the agent's log.

            post back here and let us know what you've got.

            Comment

            • Bharathu
              Member
              • Dec 2005
              • 56

              #7
              Still in problem

              Hi

              I tried as you said. I added a host with "host name = 10.239.19.120" this machine has the agent installed and is working on win XP. I also selected "use IP" and entered the same IP of my host. I selected the port to 10051. I did not add anything from the templates. I clearly dont know wat to do with them...so didnt touch it.

              Next i added an ITEM with "description = cpu load", host = 10.239.19.120, type = zabbix agent, and left key and units fiels NULL. This is still not working....when i went to "Monitoring/Latest data"...... it is just showing a "-" for "cpu load" in the last check, last value & change fields.

              I found that it was showing "Cannot connect to [10.239.19.120] [Connection refused] " in Configuration/Host ( for my host --> 10.239.19.120)

              Comment

              • Alexei
                Founder, CEO
                Zabbix Certified Trainer
                Zabbix Certified SpecialistZabbix Certified Professional
                • Sep 2004
                • 5654

                #8
                Originally posted by Bharathu
                Next i added an ITEM with "description = cpu load", host = 10.239.19.120, type = zabbix agent, and left key and units fiels NULL.
                1. Item's key must not be empty!
                2. Check if ZABBIX server is listening port 10051 (netstat -an|grep 10051)
                3. Try to connect to ZABBIX server from the monitored host (telnet <server> 10051)
                Alexei Vladishev
                Creator of Zabbix, Product manager
                New York | Tokyo | Riga
                My Twitter

                Comment

                • Bharathu
                  Member
                  • Dec 2005
                  • 56

                  #9
                  item key

                  hi...
                  wat should i type in item key? for example for cpu load...wat should i take as the key?? And i cannot use telnet..... it does not work....
                  And i get this message when i use netstat command

                  [zabbix@localhost bin]$ netstat -an|grep 10051
                  tcp 0 0 0.0.0.0:10051 0.0.0.0:* LISTEN
                  regds,
                  Bharathu
                  Last edited by Bharathu; 10-01-2006, 16:17.

                  Comment

                  • Bharathu
                    Member
                    • Dec 2005
                    • 56

                    #10
                    server log!!

                    i have the following error in my server log file

                    ****************************************
                    030325:20060110:181048 Starting zabbix_server. ZABBIX 1.1beta4.
                    030327:20060110:181048 server #1 started [Alerter]
                    030329:20060110:181048 server #2 started [Timer]
                    030331:20060110:181048 server #3 started [ICMP pinger]
                    030333:20060110:181048 server #4 started [Escalator]
                    030335:20060110:181048 server #5 started [Poller. SNMP:OFF]
                    030337:20060110:181048 server #6 started [Trapper]
                    030339:20060110:181048 server #7 started [Trapper]
                    030335:20060110:181048 Cannot connect to [10.239.19.120] [Connection refused]
                    030335:20060110:181048 Host [10.239.19.120] will be checked after [60] seconds
                    030341:20060110:181048 server #8 started [Trapper]
                    030346:20060110:181048 server #9 started [Trapper]
                    030348:20060110:181048 server #10 started [Trapper]
                    030325:20060110:181048 server #0 started [Housekeeper]
                    030335:20060110:181148 Cannot connect to [10.239.19.120] [Connection refused]
                    030335:20060110:181148 Host [10.239.19.120] will be checked after [60] seconds
                    030335:20060110:181248 Cannot connect to [10.239.19.120] [Connection refused]
                    030335:20060110:181248 Host [10.239.19.120] will be checked after [60] seconds
                    030335:20060110:182248 Cannot connect to [10.239.19.120] [Connection refused]
                    030335:20060110:182248 Cannot connect to [10.239.19.120] [Connection refused]

                    Comment

                    • Bharathu
                      Member
                      • Dec 2005
                      • 56

                      #11
                      Please Help

                      one more thing ....i could not configure the zabbix server with "net-snmp"support as it gave me errors. So i did not use the option "--with-net-snmp" while configuring the server.
                      Please send me some valuable tips.... thanking u all in anticipation
                      Bharathu
                      Last edited by Bharathu; 11-01-2006, 13:54.

                      Comment

                      • elkor
                        Senior Member
                        • Jul 2005
                        • 299

                        #12
                        documentation is here for item key syntax. The name is simply a label, it is the key that tells the server what to actually monitor.

                        for cpu load the key is system.cpu.load.

                        I know the documentation is sparse, but as soon as you get connectivity I HIGHLY suggest you spend some time going through it and working with the program in detail or it will be a very rocky road

                        Comment

                        • Bharathu
                          Member
                          • Dec 2005
                          • 56

                          #13
                          thanks

                          hi
                          Thanks for the info. I will surely go through the documentation once again as u said. But can u please help me in checking whether my configuration files (server & agent) are correct?? I have attached them in my first post. Also i really dont understand how to make the s/w run in polling mode or active check mode. I heard that active check is the best...please can u help me by just giving an example on how to configure the active check mode and the polling mode. I'm getting confused with terms "server port", "listen port", "listen ip" etc... i feel i am going wrong somewhere here ....which is leading to my problems...
                          thanking u all in anticipation

                          Bharathu
                          Last edited by Bharathu; 11-01-2006, 06:20.

                          Comment

                          • Bharathu
                            Member
                            • Dec 2005
                            • 56

                            #14
                            can anyone check my config file?

                            please... can u check the config file i attached and tell me if i have them right!!! Please guys anyone with any ideas? I am still not able to record any values!!!
                            Last edited by Bharathu; 11-01-2006, 13:51. Reason: no replies!!

                            Comment

                            • Bharathu
                              Member
                              • Dec 2005
                              • 56

                              #15
                              telnet problem

                              hi i did telnet to my linux server from the windows host(telnet 10.239.18.13 10051)... and then immediately i used the netstat command on my linux box to see (netstat -an |grep 10.239.19.120 )
                              Then i got this message....
                              tcp 0 0 10.239.18.13:10051 10.239.19.120:4962 ESTABLISHED

                              later after some time it goes into this state as below

                              tcp 0 0 10.239.18.13:10051 10.239.19.120:4962 TIME_WAIT

                              Dont know why this is happening? How and why it is going to TIME_WAIT state ?? need urgent help..please !!!!!

                              Comment

                              Working...