Ad Widget

**troffasky** · 29-08-2025, 12:32

The proxy is also struggling with talking to the server, proxy poller utilisation is stuck at >98%

**markfree** · 30-08-2025, 14:30

Have you checked your configuration file?
Some updates push new configuration files with minor changes. Depending on the packet manager, the application may start using the new file instead of the old one.
Its also possible that your proxy configuration needs some process tuning to better handle the load.

**troffasky** · 01-09-2025, 09:35

Proxy config file was not changed as part of upgrade, timestamp is 10 months ago.

**troffasky** · 01-09-2025, 10:02

This just "fixed itself" 18h later. It *definitely* started right after upgrading from 7.0.15 to 7.0.18.

**Semicolon** · 02-09-2025, 10:46

I also see this behavior, and it also "fixed itself." But then it went back to being busted again; this time just about all agents are reporting disconnected and down for over 30 hours.

I went from 7.0.17 to 7.0.18.

**troffasky** · 04-09-2025, 10:23

This is pretty weird. I know you're all probably thinking "networking issue, 100%" but I am convinced it's the application. The issue did not come back after restarting the service though.
I have Smokeping running on the same host and there wasn't packet loss to any of the agents or any increase in ping time or jitter.
The agents connect over a Wireguard tunnel that terminates on the Zabbix proxy host.
The Zabbix server connects to the Zabbix proxy through a more normal L3 route through the datacentre firewall.
I cannot replicate the issue by restarting the service so not sure what useful information I can add for troubleshooting.

**Markku** · 04-09-2025, 10:55

My guess is some kind of TCP resource starvation: too many TCP connections coming to the proxy or something like that.

See and search for various tools that give you some idea of the TCP socket situation, I'd start with plain "ss" command. And/or try to increase the various kernel-level TCP socket settings.

From above (the use of Wireguard tunnel) I understand that your proxy is reachable from internet, so it may be just the noise from internet causing this. Or then something else, hard to say without knowing all the surrounding details.

If your skills allow, use Wireshark+sshdump or tcpdump to see the traffic and figure out what's happening.

Markku

**troffasky** · 04-09-2025, 17:40

Interesting, never used ss command before. Would be good as a Zabbix agent metric.
I don't recall anything in journalctl about running out of TCP sockets and the issue didn't go away after a reboot.
Like I said, I can't reproduce it so can't troubleshoot this any further, but I will look at that ss command if it happens again.
FWIW I had no issues SSHing on or browsing the web interface.

**Markku** · 05-09-2025, 12:31

If the issue reoccurs, you may also want to verify the proxy metrics and if you need to increase the number of trappers on the proxy (unless there are lots of them already configured).

Markku

**troffasky** · 05-09-2025, 14:47

This metric is collected and you cannot even tell where the issue begins (1) and ends (2):

**Semicolon** · 08-09-2025, 17:37

It came back for me, and lasted for nearly a week. Then it self-resolved three days ago.
It just came back an hour ago.

I can confirm that it is not a TCP ephemeral port exhaustion issue, or some other tcp resource issue.

**Markku** · 08-09-2025, 18:56

Originally posted by Semicolon

It came back for me, and lasted for nearly a week. Then it self-resolved three days ago.
It just came back an hour ago.

I can confirm that it is not a TCP ephemeral port exhaustion issue, or some other tcp resource issue.

How do the TCP sessions look like in packet capture?

Markku

**Semicolon** · 11-09-2025, 17:20

Originally posted by Markku

How do the TCP sessions look like in packet capture?

Markku

It repaired itself this morning; I will capture the next time it starts to fail

Ad Widget

Issues with active agents after proxy update to 7.0.18

Issues with active agents after proxy update to 7.0.18

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment