Ad Widget

Collapse

Many Servers are unreachable for more than 5 minutes

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • moneynut
    Member
    • Mar 2014
    • 37

    #31
    Originally posted by tchjts1
    We have asked you for information (that you have not provided) to try and help you out. As I have mentioned in this thread, these graphs are going to tell a lot about your setup and what may need tweaking.



    Provide a 24 hour view of Zabbix Internal process busy graph, Zabbix data gathering process busy graph and Zabbix Cache usage graph.

    I don't mean to be curt, but you ask "what's next" when you are disregarding troubleshooting steps we have asked for way back in this thread.
    I din't intentionally disregard it. I've been on other forums and been quite busy on zabbix and been working with others to figure out the issue. I'll attach it 5 minutes, I forgot to attach to this post the other day.

    Comment

    • moneynut
      Member
      • Mar 2014
      • 37

      #32
      Actually there is no data in data gathering graphs and other graphs.

      I've attached it anyway. It looks same for all graphs.
      Last edited by moneynut; 26-03-2014, 20:49.

      Comment

      • moneynut
        Member
        • Mar 2014
        • 37

        #33
        Here is 7 day graph
        Last edited by moneynut; 26-03-2014, 20:49.

        Comment

        • moneynut
          Member
          • Mar 2014
          • 37

          #34
          Sorry for the spam. Unable to attach other graphs.

          Here is the link to Internal and cache graphs.


          Comment

          • aib
            Senior Member
            • Jan 2014
            • 1615

            #35
            Does you server actually works Or ever worked?

            Could you check it by command:
            Code:
            # tail -f /var/log/zabbix/zabbix_server.log
            and tell us if anything happens inside of log-file?
            Sincerely yours,
            Aleksey

            Comment

            • moneynut
              Member
              • Mar 2014
              • 37

              #36
              Originally posted by aib
              Does you server actually works Or ever worked?

              Could you check it by command:
              Code:
              # tail -f /var/log/zabbix/zabbix_server.log
              and tell us if anything happens inside of log-file?
              It works fine. Just sends false alerts. But yes, it does alerts if the host really goes down as well and CPU and disk space etc. I just don't know why there were no graphs.

              tail LOG :

              17019:20140321:121657.881 Zabbix agent item "net.if.in[Microsoft ISATAP Adapter #2]" on host "HOST-008" failed: first network error, wait for 15 seconds
              17023:20140321:121721.897 resuming Zabbix agent checks on host "BSC-008": connection restored
              17022:20140321:121754.057 resuming Zabbix agent checks on host "HOST-128": connection restored
              17017:20140321:121824.945 Zabbix agent item "net.if.in[WAN Miniport (IPv6)-QoS Packet Scheduler-0000]" on host "HOST-155" failed: first network error, wait for 15 seconds
              17022:20140321:121854.705 resuming Zabbix agent checks on host "HOST-102": connection restored
              17022:20140321:121855.035 resuming Zabbix agent checks on host "HOST-098": connection restored
              17022:20140321:121855.361 resuming Zabbix agent checks on host "QA-SERVER": connection restored
              17022:20140321:121905.778 resuming Zabbix agent checks on host "HOST-222": connection restored
              Last edited by moneynut; 21-03-2014, 20:32.

              Comment

              • moneynut
                Member
                • Mar 2014
                • 37

                #37
                Originally posted by aib
                .
                Is there anything else I can upload that might help us to troubleshoot?

                Comment

                • tchjts1
                  Senior Member
                  • May 2008
                  • 1605

                  #38
                  Originally posted by moneynut
                  Actually there is no data in data gathering graphs and other graphs.

                  I've attached it anyway. It looks same for all graphs.
                  Look at the items for your Zabbix server and specifically the items that belong to Template App Zabbix Server. Is there any indication of them "Not supported" or "disabled"?
                  Attached Files

                  Comment

                  • moneynut
                    Member
                    • Mar 2014
                    • 37

                    #39
                    Originally posted by tchjts1
                    Look at the items for your Zabbix server and specifically the items that belong to Template App Zabbix Server. Is there any indication of them "Not supported" or "disabled"?
                    No. Everything is enabled and there are no errors.

                    Comment

                    • aib
                      Senior Member
                      • Jan 2014
                      • 1615

                      #40
                      Silly question - Could you tell me a little bit about "History Storage Period" for any Items in Zabbix Server Host?
                      Sorry for asking that but I had the past experience when Storage Period was equal 0 and Zabbix didn't save any data for that Item.
                      Sincerely yours,
                      Aleksey

                      Comment

                      • moneynut
                        Member
                        • Mar 2014
                        • 37

                        #41
                        Originally posted by aib
                        Silly question - Could you tell me a little bit about "History Storage Period" for any Items in Zabbix Server Host?
                        Sorry for asking that but I had the past experience when Storage Period was equal 0 and Zabbix didn't save any data for that Item.
                        History Storage period (in Days) = 7

                        Comment

                        • tchjts1
                          Senior Member
                          • May 2008
                          • 1605

                          #42
                          Originally posted by moneynut
                          No. Everything is enabled and there are no errors.
                          Are you looking at the items via Configuration --> Host --> Items, or are you looking at Configuration --> Templates --> Items ?

                          You need to look at Host --> Items.

                          And if that is where you see they are enabled and with no errors, then what are you seeing under Monitoring --> Latest Data --> <Zabbix Server> then under the category of "Zabbix Server" ?

                          If they are reporting correctly, you should see data there similar to this:
                          Attached Files

                          Comment

                          • moneynut
                            Member
                            • Mar 2014
                            • 37

                            #43
                            Originally posted by tchjts1
                            Are you looking at the items via Configuration --> Host --> Items, or are you looking at Configuration --> Templates --> Items ?

                            You need to look at Host --> Items.

                            And if that is where you see they are enabled and with no errors, then what are you seeing under Monitoring --> Latest Data --> <Zabbix Server> then under the category of "Zabbix Server" ?

                            If they are reporting correctly, you should see data there similar to this:
                            I'll update this post tomorrow. (after 24 hours). I managed to fix the zabbix graph monitoring. I'll upload all Zabbix server graphs.

                            Comment

                            • moneynut
                              Member
                              • Mar 2014
                              • 37

                              #44
                              Here are the graphs. It looks bad. Server performance (Ram and CPU) is normal FYI.
                              Attached Files

                              Comment

                              • tchjts1
                                Senior Member
                                • May 2008
                                • 1605

                                #45
                                Originally posted by moneynut
                                Here are the graphs. It looks bad. Server performance (Ram and CPU) is normal FYI.
                                It might look bad, but the good news is that now you know what needs to be adjusted. On your Zabbix server in zabbix_server.conf:

                                Increase:
                                StartPollers=xx (Increase by 10 or 15 at a time until stable)
                                StartPollersUnreachable=xx (try 10 or 15)
                                StartDiscoverers=2
                                TrendCacheSize=64M

                                Leave StartDBSyncers=4 (Do not change)

                                Make sure all the above lines (if changed from default) are not preceded by a comment # or they will use the default settings. Restart your Zabbix server process. Give it 10 or 15 minutes then check your graphs graphs agin. Adjust settings as further necessary.

                                When I adjust any of those settings, I prefer to leave the default line in place and put my new value onto a new line such as this:

                                ### Option: StartPollers
                                # Number of pre-forked instances of pollers.
                                #
                                # Mandatory: no
                                # Range: 0-1000
                                # Default:
                                # StartPollers=5
                                StartPollers=350

                                Comment

                                Working...