Ad Widget

**tim.mooney** · 17-12-2025, 06:02

When everything is stable and working well and there have been no security vulnerabilities that impact your environment, it's fine to stick with an old version.

When you're having trouble, though, one of the first things most vendors will tell you to do is make sure you're running the latest version (in your series).

Are all the client systems that are having problems on VM hosts? Or are the problems impacting both physical hosts and VMs?

Have you left a 'tcpdump' or 'wireshark' network packet trace active during the time when the issues happen? I'm just wondering if that will give you any insight into where the problem may be.

**Mayhem** · 18-12-2025, 23:30

Originally posted by tim.mooney

When everything is stable and working well and there have been no security vulnerabilities that impact your environment, it's fine to stick with an old version.

When you're having trouble, though, one of the first things most vendors will tell you to do is make sure you're running the latest version (in your series).

Are all the client systems that are having problems on VM hosts? Or are the problems impacting both physical hosts and VMs?

Have you left a 'tcpdump' or 'wireshark' network packet trace active during the time when the issues happen? I'm just wondering if that will give you any insight into where the problem may be.

Hey,

Unfortunately, due to corporate I am not allowed to upgrade right now. They are trying to standardize and consolidate some programs and companies they've acquired into fewer instances, and we are already the most modern. But I will say it was working OK for a year before we ran into the issue. While we have continued to ingest additional servers in, it's not been large amounts after the initial spin up of the app.

Everything we have ingested right now is a VM. There were only a few bare metal servers remaining and we've decommissioned them. I did check with our virtualisation team but they're not seeing anything on their side.

As for the tcpdump and wireshark idea, I haven't done so again, everything is very restricted and silo'd in my company. As a result, I can't run those myself on any of the servers. I will need to reach out to my Unix team to see if they can set that up. I've also asked them to pull the message log from the server.

A couple of things I've noticed since I posted:

One, this issue is happening more frequently than expected. We see the error message about not being able to connect several times a day, but only very briefly, so they don't fire alerts like they do at night. As well. do have some outstanding queue items pretty consistently under Zabbix Agent Active, so I was thinking that maybe there is a constantly low level issue that get exacerbated when the system backs up and additional load is placed on it. Though I still haven't really seen anything new in the zabbix server logs.

I was thinking of increasing some of the capacity options a little bit in the config to see if that helps.

Ad Widget

Zabbix Alert Storm Issues

Zabbix Alert Storm Issues

Comment

Comment