I have 7.0.6 installed on Ubuntu 24.04.01 on HyperV, it's a new-ish install (12 days old) and very small (19 hosts), so negligible processing load, using Postgresql 16.
In the wee hours it seemed to stop processing items for 20 +/- minutes, which caused a flurry of unavailable alerts. I think this queue issue is the cause of the unavailable (by ping) errors, but I am baffled as to the cause of the processing issue.
I've checked the syslog, zabbix log, postgresql log and see nothing unusual in them, and no discontinuities.
I've checked the hyperv logs and the windows logs for the hypervisor. Nothing unusual there. My initial thought was some kind of freeze, e.g. from a backup snapshot, but no sign of that. Time server logs show no jump or adjustment in time on either the guest or hypervisor, though my gut tells me this is some kind of time jump. There was no reboot, no restart of postgresql or zabbix server.
The Postgresql server is on the same guest, and only used for zabbix and netdisco, so it's not like an external database hung. And for bureaucracy reasons I haven't started backups of zabbix or postgresql so not related to anything like that. No zabbix proxy or external aspects at all. The hypervisor is a dedicated management server at the moment running nothing else but a domain controller.
No action was taken to resolve the issue (everyone was asleep), it just fixed itself. It only happened the once. I've got zabbix running in a half dozen clients in similar configurations and never seen this (though this is the only production 7.x version so far, other than my home).
Does anyone have suggestions of things to check? What could be the cause?
Linwood
In the wee hours it seemed to stop processing items for 20 +/- minutes, which caused a flurry of unavailable alerts. I think this queue issue is the cause of the unavailable (by ping) errors, but I am baffled as to the cause of the processing issue.
I've checked the syslog, zabbix log, postgresql log and see nothing unusual in them, and no discontinuities.
I've checked the hyperv logs and the windows logs for the hypervisor. Nothing unusual there. My initial thought was some kind of freeze, e.g. from a backup snapshot, but no sign of that. Time server logs show no jump or adjustment in time on either the guest or hypervisor, though my gut tells me this is some kind of time jump. There was no reboot, no restart of postgresql or zabbix server.
The Postgresql server is on the same guest, and only used for zabbix and netdisco, so it's not like an external database hung. And for bureaucracy reasons I haven't started backups of zabbix or postgresql so not related to anything like that. No zabbix proxy or external aspects at all. The hypervisor is a dedicated management server at the moment running nothing else but a domain controller.
No action was taken to resolve the issue (everyone was asleep), it just fixed itself. It only happened the once. I've got zabbix running in a half dozen clients in similar configurations and never seen this (though this is the only production 7.x version so far, other than my home).
Does anyone have suggestions of things to check? What could be the cause?
Linwood
Comment