Ad Widget

Collapse

Multiple cross emails from actions upgrade from 3.0 to 4.0.7 : broken

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • zabbixfk
    Senior Member
    • Jun 2013
    • 256

    #1

    Multiple cross emails from actions upgrade from 3.0 to 4.0.7 : broken

    Hello,

    I upgraded from 3.0.13 to 4.0.7, and now it seems actions is broken. Please help.
    For a single action, i am seeing more than 30 emails sent (i have configured only two from user group, each of usergroup has one id so two email should go, works on 3.0. but not here)and it seems its mixing up body from another trigger and sending email here. This is broken big time, please help .....
    There was no changes on trigger, it was a blind db upgrade - i mean i started db and installed new binaries and db got upgraded.
    I upgraded because 4.0.7 is LTS. here is my config

    Code:
    zabbix_server (Zabbix) 4.0.7
    Revision 92831 18 April 2019, compilation time: Jun  3 2019 15:59:42
    
    Copyright (C) 2019 Zabbix SIA
    License GPLv2+: GNU GPL version 2 or later <http://gnu.org/licenses/gpl.html>.
    This is free software: you are free to change and redistribute it according to
    the license. There is NO WARRANTY, to the extent permitted by law.
    
    This product includes software developed by the OpenSSL Project
    for use in the OpenSSL Toolkit (http://www.openssl.org/).
    
    Compiled with OpenSSL 1.0.2k-fips  26 Jan 2017
    Running with OpenSSL 1.0.2k-fips  26 Jan 2017

    Trigger,

    Code:
    Name : Low memory on {HOST.NAME}
    Severity : Warning
    Expression : (({DC-Windows-Servers:vm.memory.size[free].last(180)} * 100 ) / {DC-Windows-Servers:vm.memory.size.last(180)} )<5
    OK event generation : Expression ( auto selected after upgrade ,basically this wasn't there on 3.0, so i don't know about this)
    PROBLEM event generation mode : single  ( auto selected after upgrade ,basically this wasn't there on 3.0, so i don't know about this)
    OK event closes : All problems :  ( auto selected after upgrade ,basically this wasn't there on 3.0, so i don't know about this)
    rest all are blank.

    Actions:

    Code:
    Name : Low Memory On Windows Server :{HOST.NAME}
    Conditions : Label  A , Name  Trigger equals [I]DC-Windows-Servers: Low memory on DC-Windows-Servers [/I]
    New condition : Blank
    
    Operations  :
    Default operation step duration : 1m (Don't know what's this, this wasn't there on 3.0)
    Default subject : {EVENT.NAME}
    Default message :  
    HostName : {HOST.NAME}
    IP : {HOST.IP}
    Status : {TRIGGER.STATUS}
    
    
    1. {ITEM.NAME1}  - {ITEM.VALUE1}
    2. {ITEM.NAME2}  - {ITEM.VALUE2}
    
    Note: this is a system generated email. Please do not reply to this email.
    
    
    Recovery Operations:
    Default subject : {TRIGGER.STATUS}: {EVENT.NAME}
    Default message :
    Trigger: {EVENT.NAME}
    Trigger status: {TRIGGER.STATUS}
    Trigger severity: {TRIGGER.SEVERITY}
    Trigger URL: {TRIGGER.URL}
    
    Item values:
    1. {ITEM.NAME1} ({HOST.NAME1}:{ITEM.KEY1}): {ITEM.VALUE1}
    2. {ITEM.NAME2} ({HOST.NAME2}:{ITEM.KEY2}): {ITEM.VALUE2}
    3. {ITEM.NAME3} ({HOST.NAME3}:{ITEM.KEY3}): {ITEM.VALUE3}
    
    Operations : Blank
    
    
    Update Operations:
    Default subject : Acknowledged: {EVENT.NAME}
    Default message : {USER.FULLNAME} acknowledged problem at {ACK.DATE} {ACK.TIME} with the following message:
    {ACK.MESSAGE}
    
    Current problem status is {EVENT.STATUS}
    
    Operations : Blank.
    Actions When clicked on a particular event , looks like evets are grouped unlike they are previous where displayed one by one per trigger :
    HTML Code:
    Actions   	     	 		 			Step 			Time 			User/Recipient 			Action 			Message/Command 			Status 			Info 		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			HostName : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    			2. Total Memory - 10 GB
    
    
    
    
    			Note: this is a system generated email. Please do not reply this email 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status : PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    
    
    
    			Note: This is system generated email Please do not reply to this email 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			HostName : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    			2. Total Memory - 10 GB
    
    
    
    
    			Note: this is a system generated email. Please do not reply this email 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			HostName : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			TrunkUtilisation Details -
    
    
    
    
    			1. Free Memory : 102.39 MB
    
    			2. Total Memory : 10 GB
    
    			3. *UNKNOWN* : *UNKNOWN*
    
    			4. *UNKNOWN* : *UNKNOWN*
    
    			5. *UNKNOWN* : *UNKNOWN*
    
    			6. *UNKNOWN* : *UNKNOWN*
    
    			7. *UNKNOWN* : *UNKNOWN*
    
    			8. *UNKNOWN* : *UNKNOWN*
    
    
    
    
    			Note: This is a system generated email. Please do not reply to this email. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			HostName : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory : 102.39 MB
    
    
    
    
    			Note: This is a system generated email. Please do not reply to this email. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			HostName : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory : 102.39 MB
    
    
    
    
    			Note: This is a system generated email. Please do not reply to this email. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    
    
    
    			Note: This is system generated email. Please do not reply to this email. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    
    
    
    			Note: This is system generated email. Please do not reply to this email. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory : 102.39 MB
    
    
    
    
    			Note: This is a system generated email. Please do not reply to this email. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status: PROBLEM
    
    
    
    
    			1. Free Memory : 102.39 MB
    
    
    
    
    			Note: This is a system generated email. Please do not reply to this email. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2 : 172.x.x.62 : P-DC2:vm.memory.size[free]
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status : PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    
    
    
    			Note: This is system generated email. Please do not reply to this message. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2 : 172.x.x.62 : P-DC2:vm.memory.size[free]
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status : PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    
    
    
    			Note: This is system generated email. Please do not reply to this message. 			Sent 			
    			  		 		 			1 			2019-06-04 19:25:19 			[email protected] 			
    			  			Low memory on P-DC2
    
    
    
    
    			Host : P-DC2
    
    			IP : 172.x.x.62
    
    			Status : PROBLEM
    
    
    
    
    			1. Free Memory - 102.39 MB
    
    
    
    
    			Note: This is system generated email Please do not reply to this email 			
    
    
    Event list [previous 20]   	     	 		 			Time 			Recovery time 			Status 			Age 			Duration 			Ack 			Actions 		 		 			2019-06-04 19:25:16 			
    			  			PROBLEM 			1m 44s 			1m 44s 			No 			25 actions 		 		 			2019-06-04 18:35:16 			2019-06-04 18:55:16 			RESOLVED 			51m 44s 			20m 			No 			33 actions 		 		 			2019-06-04 18:30:16 			2019-06-04 18:55:16 			RESOLVED 			56m 44s 			25m 			No 			
    			  		 		 			2019-06-04 18:25:16 			2019-06-04 18:55:16 			RESOLVED 			1h 1m 44s 			30m 			No 			
  • zabbixfk
    Senior Member
    • Jun 2013
    • 256

    #2
    Can someone reply to this.... after upgrade, for every trigger we are seeing too many number of emails ( though configured is two).
    This upgrade seems broken....

    Thanks,

    Comment

    • zabbixfk
      Senior Member
      • Jun 2013
      • 256

      #3
      Can you someone help me figure out what's wrong with the upgrade here... if this is the case i can't depend on zabbix anymore !!!!, just a upgrade and it broke actions....

      Thanks,

      Comment

      • vso
        Zabbix developer
        • Aug 2016
        • 190

        #4
        Please post some information about triggers that fired and action

        Comment

        • zabbixfk
          Senior Member
          • Jun 2013
          • 256

          #5
          Thanks for replying @vso.
          I have already added trigger details, here it is once again.
          Trigger,
          Code:
          Name : Low memory on {HOST.NAME}
          Severity Warning
          Expression : [B](({DC-Windows-Servers:vm.memory.size[free].last(180)} * 100 ) / {DC-Windows-Servers:vm.memory.size.last(180)} )<5[/B]
          OK event generation : Expression [B](selected auto after upgrading from 3.0, this field wasn't there earlier)[/B]
          PROBLEM event generation mode : single [B](selected auto after upgrading from 3.0, this field wasn't there earlier)[/B]
          OK event closes : All problems [B](selected auto after upgrading from 3.0, this field wasn't there earlier)[/B]
          Tags  : none
          Allow manual close : unchecked [B](selected blank/unchecked after upgrading from 3.0, this field wasn't there earlier)[/B]
          URL : none
          Description : If the Free memory is below 5%
          Enabled : Selected.
          Dependencies : None
          Actions :

          Code:
          [B][I]Action Tab consists[/I][/B]
          Name : Low Memory On Windows Server :{HOST.NAME}
          Conditions : Label A , Name Trigger equals [I]DC-Windows-Servers: Low memory on DC-Windows-Servers [/I]
          New condition : Blank
          
          Operations  :
          Default operation step duration : 1m (Don't know what's this, this wasn't there on 3.0)
          Default subject : {EVENT.NAME}
          Default message :  
                                  HostName : {HOST.NAME}
                                  IP : {HOST.IP}
                                  Status : {TRIGGER.STATUS}  
          
                                 1. {ITEM.NAME1}  - {ITEM.VALUE1}
                                 2. {ITEM.NAME2}  - {ITEM.VALUE2}  
          
                                 Note: this is a system generated email. Please do not reply to this email.  
          
          Pause operations for suppressed problems : Unchecked [B](auto populated after upgrading from 3.0, this field wasn't there earlier)[/B]
          
          Operations :
                             Steps - 1, Details : [B]Send message to user groups: [/B]Email_DL_Compute, Email_DL_NOC via Email , Start in: Immediately , Duration : Default.  
          
          [I][B]Recovery Operations tab consists : [/B][/I][B](auto populated after upgrading from 3.0, this field wasn't there earlier)[/B]
          Default subject : {TRIGGER.STATUS}: {EVENT.NAME}
          Default message :
                                        Trigger: {EVENT.NAME}
                                        Trigger status: {TRIGGER.STATUS}
                                        Trigger severity: {TRIGGER.SEVERITY}
                                        Trigger URL: {TRIGGER.URL}  
                                        Item values:
                                                  1. {ITEM.NAME1} ({HOST.NAME1}:{ITEM.KEY1}): {ITEM.VALUE1}
                                                  2. {ITEM.NAME2} ({HOST.NAME2}:{ITEM.KEY2}): {ITEM.VALUE2}
                                                  3. {ITEM.NAME3} ({HOST.NAME3}:{ITEM.KEY3}): {ITEM.VALUE3}  
          
          Operations : Blank  
          
          [I][B]Update Operations Tab consists: [/B][/I][B](auto populated after upgrading from 3.0, this field wasn't there earlier)[/B]
          Default subject : Acknowledged: {EVENT.NAME}
          Default message : {USER.FULLNAME} acknowledged problem at {ACK.DATE} {ACK.TIME} with the following message: {ACK.MESSAGE}  Current problem status is {EVENT.STATUS}
          Operations : Blank.
          Email_DL_Compute consists one user with groups access to Servers group, where this template is tied to those groups hosts.
          Email_DL_NOC consists one user with groups access to most of the groups in system.

          SO in 3.0, whenever theres this trigger fired, it creates PROBLEM event and email goes to two groups i.e two users ( each group with one user). And same case with OK event ( email sent to two groups). Each event has only two emails - that is what its suppose to do.
          But in 4.0, several actions happening, i.e several emails going for these two groups with different actions body ( different as in i have lot of other actions in system and their body is getting sent), basically for this action about 33 emails sent.

          Another issue: Earlier ( in 3.0), we could see when clicked on Monitoring - trigger - all the events for that trigger, but now only one tab called problems, and i am not able to see older problems, i tried all the combinations on the filters with this hostname which has low memory issue, or selecting this server hostgroup or resetting all the filters but its not displaying anything.

          I have about 7K devices with more than 600 triggers and for each trigger or most of the combinations of triggers there's email action. I have disabled actions for now as emails flooding ,
          I am just stuck with this upgrade. I really need some help.

          Any pointers are greatly helpful.

          Thanks
          Last edited by zabbixfk; 06-06-2019, 19:52.

          Comment

          • vso
            Zabbix developer
            • Aug 2016
            • 190

            #6
            Action can fire multiple times only if condition is met, it is best to make sure that there are only one trigger that meet following condition:
            Name Trigger equals DC-Windows-Servers: Low memory on DC-Windows-Servers

            You can also add another condition and limiting to desired host

            Comment

            • zabbixfk
              Senior Member
              • Jun 2013
              • 256

              #7
              Thanks for the reply @vso.
              Yes, i understand. But, there's only one trigger for all the windows hosts i.e via template.
              I just don't understand, something which is working fine in 3.0, i.e sending only two emails, is sending now 30+ emails. In the first thread if you see, i have also added sample actions, there, it is sending with more fields, and some linkUtilisation action body ( this definitely from different trigger. Not from DC-Windows-Servers, but from anther action).

              And let me tell you, this is not only happening to one particular template, for all the actions i am seeing multiple emails !!!!!! So definitely nothing to with this action or template - something wrong at code level.

              Here are the screenshots of the action and other configs, but it seems i am no logner able to locate that problem anymore, in the problems page its not getting displayed now.

              Is this something to do with this bug?



              Thanks

              Comment

              • vso
                Zabbix developer
                • Aug 2016
                • 190

                #8
                Please open action and provide screenshot what are the unknown values ?

                Comment

                • zabbixfk
                  Senior Member
                  • Jun 2013
                  • 256

                  #9
                  Thanks for the reply @vso.
                  Whatever is coming n UNKOWN is from different action not from this and that is what i am saying cross action mails and multiple mails for single action.
                  Here is the screenshot for the action.

                  Comment

                  • zabbixfk
                    Senior Member
                    • Jun 2013
                    • 256

                    #10
                    See, i enabled actions, and i am seeing the number of emails going up and up rising...
                    Some cross actions happening here... i don't know how and why is this problem. I had to disable all the aciton in order to minimise the issue.
                    Please help.

                    Comment

                    • zabbixfk
                      Senior Member
                      • Jun 2013
                      • 256

                      #11
                      If you see closely, one of the trigger - Low memory - for example, in addition to matching its own body, its matching most of the other action bodies and applying action on it. i.e sending email. You can refer last thread sceenshots.

                      Looks like its either this bug or some other bug. Basically, when i enable all the actions, every event will match with most of the actions and sending emails. Action / Email body is getting filled with random action body. I am able to reproduce this with what i have. If you want, i can provide my trigger and action dumps or any other details required , but, this is something very bad for a LTS release. All i did was upgrading from 3.0 to 4.0 i.e upgrade by running zabbix_server after compiling 4.0 version - yes i do compile, doesn't rely on rpms - i have been doing this since 2.0.x versions. - it did upgrade the database and started monitoring too - though i haven't enabled all the devices for monitoring - for testing i enabled some and i am encountering this error.

                      Can someone from zabbix team look into this? Or any others - any pointers are greatly helpful.

                      Thanks
                      Last edited by zabbixfk; 08-06-2019, 07:45.

                      Comment

                      • vso
                        Zabbix developer
                        • Aug 2016
                        • 190

                        #12
                        It would be best to localize issue and leave one action enabled, are you saying that adding trigger condition in action results in action being fired for all other triggers ?

                        Comment

                        • zabbixfk
                          Senior Member
                          • Jun 2013
                          • 256

                          #13
                          - When i have only one action enabled, i.e example low memory , i am getting only two emails - which is what that action defined and its working as expected.
                          Now, when i start enabling all the actions, ( have about 250+) for different triggers , each action which is fired is sending more than 30 emails. i.e
                          - most of the email body is not from desired action.
                          - It seems its pulling action body from some other actions defined.
                          - For example, for low memory acton, its pulling body from linkUtilisation action and values are filled as UNKNOWN.
                          - All the emails received as part of this have heading/body on the email says low memory.
                          - This is not only happening for low memory but also for all other triggers, if you look at my screenshots, triggers such as , zabbix process more than 75% busy, DHCP scope issues etc also firing more than 30 emails. And this is unaccepted as these actions have only two address configured to send emails.

                          And i can't keep only one action enabled, i have to enable all the actions while rolling out for production.

                          Really stuck on this upgrade part, need some inputs/help. Thanks in advance.

                          Comment

                          • zabbixfk
                            Senior Member
                            • Jun 2013
                            • 256

                            #14
                            *bump* - can someone from zbx team or others look at this? Since this is happening to a LTS build, i feel this is quite serious issue.

                            Comment

                            • vso
                              Zabbix developer
                              • Aug 2016
                              • 190

                              #15
                              If you think that there is a bug then it’s better to create a bug report, currently it’s unclear how to reproduce the issue.

                              Please note that actions only fire after trigger is fired, what possibly is happening is that trigger fires and then multiple actions get executed for that trigger because action conditions match, is that the case ? Are those 30 mails actually 30 different actions ?

                              Comment

                              Working...