Ad Widget

Collapse

Orabbix:"Another way to monitor Oracle reloaded"

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • dalle
    Senior Member
    Zabbix Certified Specialist
    • Mar 2009
    • 402

    #151
    Originally posted by brian
    I don't think it is possible, as they are production servers, but if you let me know what to test, I will do it.
    Ok,
    How many java processes there are? I mean there are other java application running?
    There is a firewall installed? ( May be that the object that send response to Zabbix have problem if packet are dropped)
    Are you sure that is Orabbix java process that consume all your cpu? i mean have you verified with a Top? (because may be that It's not directly Orabbix that use all that CPU but it's just the cause)
    It's hard to debug because on log there aren't error at all. I see that Orabix retrieve all the needed information and sleep for 5 minute, and run so again. I've see that there are some problem to connection on RDBMS may be that there is a firewall between Orabbix and Oracle?
    Andrea Dalle Vacche
    website:http://www.smartmarmot.com/
    e-mail:
    Author of:Mastering Zabbix Book - second edition
    Zabbix Network Monitoring Essentials

    Comment

    • brian
      Member
      • Dec 2009
      • 45

      #152
      Thanks for your helping attitude. Tonight after business hours I will do some testing with orabbix and I will report you back.

      Comment

      • bhiggens
        Junior Member
        • Oct 2010
        • 2

        #153
        Startup Issue with Instance Down

        Here is an issue I encountered when testing Orabbix 1.1.0-RC2. I think I encountered the same issue with previous versions as well.

        When I start Orabbix and one of the database instances is down, the below error is recorded in the log file
        2010-11-19 13:31:16,240 [main] ERROR Orabbix - Error on Configurator for database %My DBName% -->Io exception: The Network Adapter could not establish the connection
        and this error is returned to the prompt at which point orabbix terminates.
        java.lang.NullPointerException at main.main(main.java:107)


        Aside from that, RC2 has been performing just fine.

        Brian

        Comment

        • jvalenzani
          Member
          Zabbix Certified Specialist
          • Sep 2010
          • 53

          #154
          SQLException with 1.1.0 RC2

          I'm testing the rc2 on a development server with SuSE10. Server is receiving empty data and Orabbix is throwing this exception:

          2010-11-24 13:11:00,907 [pool-1-thread-51] ERROR Orabbix - Error on DBEnquiry on query=waits_other on database=XXXXX Error returned is java.sql.SQLException: Connection oracle.jdbc.driver.T4CConnection@19a0c7c is closed.

          Any idea on what's happening?

          Thanks.

          Comment

          • dalle
            Senior Member
            Zabbix Certified Specialist
            • Mar 2009
            • 402

            #155
            Originally posted by jvalenzani
            I'm testing the rc2 on a development server with SuSE10. Server is receiving empty data and Orabbix is throwing this exception:

            2010-11-24 13:11:00,907 [pool-1-thread-51] ERROR Orabbix - Error on DBEnquiry on query=waits_other on database=XXXXX Error returned is java.sql.SQLException: Connection oracle.jdbc.driver.T4CConnection@19a0c7c is closed.

            Any idea on what's happening?

            Thanks.
            May be that there is a firewall between Orabbix and Oracle?
            Can you send me the full log? here on private message is the same as you want
            Cheers

            Andrea
            Andrea Dalle Vacche
            website:http://www.smartmarmot.com/
            e-mail:
            Author of:Mastering Zabbix Book - second edition
            Zabbix Network Monitoring Essentials

            Comment

            • jvalenzani
              Member
              Zabbix Certified Specialist
              • Sep 2010
              • 53

              #156
              They are on the same server.

              I'll send you a PM with full log, but it's pretty similar.

              Comment

              • dalle
                Senior Member
                Zabbix Certified Specialist
                • Mar 2009
                • 402

                #157
                Originally posted by jvalenzani
                They are on the same server.

                I'll send you a PM with full log, but it's pretty similar.
                Please try to replace with this jar
                probably I've introduce a logical error i'm not sure please try with this jar in attach and let me know
                Attached Files
                Andrea Dalle Vacche
                website:http://www.smartmarmot.com/
                e-mail:
                Author of:Mastering Zabbix Book - second edition
                Zabbix Network Monitoring Essentials

                Comment

                • jvalenzani
                  Member
                  Zabbix Certified Specialist
                  • Sep 2010
                  • 53

                  #158
                  It works.

                  thanks for your help.

                  Comment

                  • jvalenzani
                    Member
                    Zabbix Certified Specialist
                    • Sep 2010
                    • 53

                    #159
                    Now Server is throwing some errors, but getting data.
                    Code:
                      6717:20101124:184320.547 Item [AEDESA2:uptime] error: Not supported by Zabbix Agent
                      6722:20101124:184459.173 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]
                      6722:20101124:184923.424 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]
                      6734:20101124:185011.214 Executing housekeeper
                      6734:20101124:185015.172 Deleted 0 records from history and trends
                      6715:20101124:185243.693 Item [AEDESA2:perf_counter[\LogicalDisk(C:)\% Free Space]] error: Not supported by Zabbix Agent
                      6718:20101124:185244.693 Item [AEDESA2:perf_counter[\LogicalDisk(D:)\% Free Space]] error: Not supported by Zabbix Agent
                      6719:20101124:185245.693 Item [AEDESA2:perf_counter[\Paging File(_Total)\% Usage]] error: Not supported by Zabbix Agent
                      6718:20101124:185320.752 Item [AEDESA2:uptime] error: Not supported by Zabbix Agent
                      6726:20101124:185409.299 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]
                      6724:20101124:185910.117 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]

                    Comment

                    • dalle
                      Senior Member
                      Zabbix Certified Specialist
                      • Mar 2009
                      • 402

                      #160
                      Originally posted by jvalenzani
                      Now Server is throwing some errors, but getting data.
                      Code:
                        6717:20101124:184320.547 Item [AEDESA2:uptime] error: Not supported by Zabbix Agent
                        6722:20101124:184459.173 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]
                        6722:20101124:184923.424 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]
                        6734:20101124:185011.214 Executing housekeeper
                        6734:20101124:185015.172 Deleted 0 records from history and trends
                        6715:20101124:185243.693 Item [AEDESA2:perf_counter[\LogicalDisk(C:)\% Free Space]] error: Not supported by Zabbix Agent
                        6718:20101124:185244.693 Item [AEDESA2:perf_counter[\LogicalDisk(D:)\% Free Space]] error: Not supported by Zabbix Agent
                        6719:20101124:185245.693 Item [AEDESA2:perf_counter[\Paging File(_Total)\% Usage]] error: Not supported by Zabbix Agent
                        6718:20101124:185320.752 Item [AEDESA2:uptime] error: Not supported by Zabbix Agent
                        6726:20101124:185409.299 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]
                        6724:20101124:185910.117 Item [AEDESA2:archive] error: Type of received value [] is not suitable for value type [Numeric (float)]
                      Could you try with this Query file in attach?
                      This should solve your issue with Archivelog.
                      I have no clue about uptime.... it's strange that you have such a problem.
                      please let me know
                      Attached Files
                      Andrea Dalle Vacche
                      website:http://www.smartmarmot.com/
                      e-mail:
                      Author of:Mastering Zabbix Book - second edition
                      Zabbix Network Monitoring Essentials

                      Comment

                      • dalle
                        Senior Member
                        Zabbix Certified Specialist
                        • Mar 2009
                        • 402

                        #161
                        Orabbix 1.1.0-RC3 released

                        I've released the last release of orabbix 1.1.0-RC3 with the last minor bug fixed.
                        On this release there is installation instructions on pdf.

                        more info are available here
                        the software is available for download on Sourceforge such as on my website
                        Andrea
                        Andrea Dalle Vacche
                        website:http://www.smartmarmot.com/
                        e-mail:
                        Author of:Mastering Zabbix Book - second edition
                        Zabbix Network Monitoring Essentials

                        Comment

                        • brian
                          Member
                          • Dec 2009
                          • 45

                          #162
                          Dear Dalle,
                          Answers to your questions:

                          Ok,
                          How many java processes there are? I mean there are other java application running?
                          ps ax | grep java shows only 1 line:
                          15951 pts/1 Sl 0:01 java -Dlog4j.configuration=./conf/log4j.properties -jar orabbix-1.0.4.jar ./conf/config.props

                          There is a firewall installed? ( May be that the object that send response to Zabbix have problem if packet are dropped)
                          No, we do not have any firewall on the oracle and on the zabbix server. They are on the same lan and same switch.


                          Are you sure that is Orabbix java process that consume all your cpu? i mean have you verified with a Top? (because may be that It's not directly Orabbix that use all that CPU but it's just the cause)
                          No. It is not Zabbix server which is loaded, it is the Oracle server. On the zabbix server the load is normal. On Oracle server the process that eats all CPU is "oracle".

                          When I start Orabbix on the zabbix server I can see that on the oracle server cpu usage is going higher: One "oracle" process starts to use the first core of the CPU and oracle process shows 100% on the line in "top". As time passes another oracle processes go up to 100% until 4 oracle processes sitting on the top of the list with 100% CPU usage. See attached screenshot below.
                          Server is a Dell 2950 Quad-Core Xeon with 8GB RAM.
                          Attached Files

                          Comment

                          • brian
                            Member
                            • Dec 2009
                            • 45

                            #163
                            I also tried version 1.1.0RC3 and the result is the same.

                            Comment

                            • dalle
                              Senior Member
                              Zabbix Certified Specialist
                              • Mar 2009
                              • 402

                              #164
                              Originally posted by brian
                              I also tried version 1.1.0RC3 and the result is the same.
                              Brian,
                              probably I've understood what can cause this issue.
                              please try to remove
                              audit.Query from queryfile or just remove audit from
                              Query.props
                              now you see inside Query.props
                              QueryList=archive,audit,dbblockgets,....etc....
                              you should try to remove audit
                              so:
                              QueryList=archive,dbblockgets,...etc...
                              and after restart Orabbix.
                              I say that because if you have a lot of login stored may be that auditing quey is your trouble (and may be that you have some full tables scan on aud$ or sys.dba_audit_session)
                              let me know if this solve.
                              Andrea
                              Last edited by dalle; 29-11-2010, 16:29.
                              Andrea Dalle Vacche
                              website:http://www.smartmarmot.com/
                              e-mail:
                              Author of:Mastering Zabbix Book - second edition
                              Zabbix Network Monitoring Essentials

                              Comment

                              • brian
                                Member
                                • Dec 2009
                                • 45

                                #165
                                Originally posted by dalle
                                Brian,
                                probably I've understood what can cause this issue.
                                please try to remove
                                audit.Query from queryfile
                                Andrea
                                Thanks for the advice. I tried, but it did not help, CPU load is going higher and higher.

                                But your advice gave me an idea. I disabled all but one (Hit ratio) query in the list and after I started orabbix. CPU is normal and I am getting "Hit ratio" data in zabbix.
                                I will enable the queries one-by-one, and I will let you know when the cpu starts to burn.

                                Comment

                                Working...