I've done some looking around on this forum and I can't seem to find anyone else having the same problem as me.
I recently installed Zabbix 2.2 server and its agents on about 80 servers. After adding the agents over a the course of a week or so, I was notified by our security team that the firewall between the zabbix server and its agents had a session count over 100,000 and this was causing impacts and degradation for everything passing through that firewall (this is up from the normal 25,000 sessions on it). After they blocked zabbix and I took it down, the firewall was fine back to the normal session count.
One of the firewall administrators suggested connections were not getting closed by Zabbix and maybe no FIN was being sent. I brought Zabbix back up for a short amount of time and watched the session count grow on the firewall by 30k more in only about 25 minutes time. During that time I observed both the Zabbix server and agent servers had a high number of TIME_WAIT connections existing. After reading up a bit more about TIME_WAIT, supposedly the connection is technically already closed at that point. So it appears to me Zabbix is closing the connections.
Does anybody have any clue why the firewall might be keeping the sessions open instead of closing them? It seems like they are only getting closed on the firewall when they hit a timeout 30 minute value. Even after I stopped Zabbix server and observed the TIME_WAIT connections disappear pretty quickly, the firewall did not drop the Zabbix sessions for about 30 minutes.
Really need some help for this one. Maybe there's some sort of firewall setting that needs to be adjusted that security guru on here might know? I put a lot of work into getting Zabbix 2.2.1 implemented and configured for our systems and now I'm getting lots of flak about it bringing our firewall to its knees and will have to ditch it if no resolution can be found.
I recently installed Zabbix 2.2 server and its agents on about 80 servers. After adding the agents over a the course of a week or so, I was notified by our security team that the firewall between the zabbix server and its agents had a session count over 100,000 and this was causing impacts and degradation for everything passing through that firewall (this is up from the normal 25,000 sessions on it). After they blocked zabbix and I took it down, the firewall was fine back to the normal session count.
One of the firewall administrators suggested connections were not getting closed by Zabbix and maybe no FIN was being sent. I brought Zabbix back up for a short amount of time and watched the session count grow on the firewall by 30k more in only about 25 minutes time. During that time I observed both the Zabbix server and agent servers had a high number of TIME_WAIT connections existing. After reading up a bit more about TIME_WAIT, supposedly the connection is technically already closed at that point. So it appears to me Zabbix is closing the connections.
Does anybody have any clue why the firewall might be keeping the sessions open instead of closing them? It seems like they are only getting closed on the firewall when they hit a timeout 30 minute value. Even after I stopped Zabbix server and observed the TIME_WAIT connections disappear pretty quickly, the firewall did not drop the Zabbix sessions for about 30 minutes.
Really need some help for this one. Maybe there's some sort of firewall setting that needs to be adjusted that security guru on here might know? I put a lot of work into getting Zabbix 2.2.1 implemented and configured for our systems and now I'm getting lots of flak about it bringing our firewall to its knees and will have to ditch it if no resolution can be found.
Comment