Ad Widget

**kloczek** · 02-07-2018, 09:20

Zabbix processes did not restarts themselves.

**TrevorD** · 03-07-2018, 02:35

What are you referring to kloczek ? Are you referring to the events in the log file ? below is another example of Zabbix server stopping and I and I had to restart zabbix server the following morning.

21528:20180627:001739.285 [file:vmware.c,line:86] zbx_mem_malloc(): out of memory (requested 156 bytes)
21528:20180627:001739.285 [file:vmware.c,line:86] zbx_mem_malloc(): please increase VMwareCacheSize configuration parameter
21488:20180627:001739.887 One child process died (PID:21528,exitcode/signal:1). Exiting ...
21488:20180627:001741.952 syncing history data...
21488:20180627:001741.959 item "4c4c4544-0035-4810-8036-c6c04f543932:vmware.hv.status[{$URL},{HOST.HOST}]" became not supported: Unknown hypervisor uuid.
21488:20180627:001741.959 item "4c4c4544-0043-5010-8037-b6c04f523432:vmware.hv.datastore.size[{$URL},{HOST.HOST},T2_VMDatastore7_R]" became not supported: Unknown hypervisor uuid.
21488:20180627:001741.960 item "4c4c4544-0052-4b10-8044-b8c04f523432:vmware.hv.datastore.size[{$URL},{HOST.HOST},T2_VMDatastore14_R,pfree]" became not supported: Unknown hypervisor uuid.
21488:20180627:001741.960 item "4c4c4544-0035-4b10-8034-c6c04f543932:vmware.hv.datastore.read[{$URL},{HOST.HOST},T1_VMDatastore9_NR,latency]" became not supported: Unknown hypervisor uuid.
21488:20180627:001741.960 item "4c4c4544-0035-4410-8035-c6c04f543932:vmware.hv.datastore.read[{$URL},{HOST.HOST},T1_VMDatastore1_R,latency]" became not supported: Unknown hypervisor uuid.
21488:20180627:001741.960 item "4c4c4544-0035-4410-8035-c6c04f543932:vmware.hv.datastore.write[{$URL},{HOST.HOST},T1_VMDatastore11_NR,latency]" became not supported: Unknown hypervisor uuid.
21488:20180627:001741.960 item "4c4c4544-0035-4810-8036-c6c04f543932:vmware.hv.datastore.size[{$URL},{HOST.HOST},T2_VMDatastore13_NR]" became not supported: Unknown hypervisor uuid.
21488:20180627:001741.969 syncing history data done
21488:20180627:001741.969 syncing trend data...
21488:20180627:001743.341 syncing trend data done
21488:20180627:001743.342 Zabbix Server stopped. Zabbix 3.4.7 (revision 77720).
835:20180627:082300.482 Starting Zabbix Server. Zabbix 3.4.7 (revision 77720).
835:20180627:082300.488 ****** Enabled features ******
835:20180627:082300.488 SNMP monitoring: YES
835:20180627:082300.488 IPMI monitoring: YES
835:20180627:082300.488 Web monitoring: YES
835:20180627:082300.488 VMware monitoring: YES

Below are the zabbix stats before adding vcenter.

Stats after adding vcenter

Are there settings I should optimize in the config file for the above NVP and number of items ?

Thanks

Trev

**Atsushi** · 03-07-2018, 03:26

The log is output as follows, but has the VMwareCacheSize setting been adjusted?

Code:

21528:20180627:001739.285 [file:vmware.c,line:86] zbx_mem_malloc(): out of memory (requested 156 bytes)
21528:20180627:001739.285 [file:vmware.c,line:86] zbx_mem_malloc(): please increase VMwareCacheSize configuration parameter

**TrevorD** · 03-07-2018, 03:51

Hey Atsushi,

Yes this config has been updated as per below:

### Option: VMwareCacheSize
# Size of VMware cache, in bytes.
# Shared memory size for storing VMware data.
# Only used if VMware collectors are started.
#
# Mandatory: no
# Range: 256K-2G
# Default:
VMwareCacheSize=2048M

**TrevorD** · 03-07-2018, 07:24

I have just added VCenter server back into Zabbix this morning so I will keep my eye on it to see if the issue occurs again. Zabbix server seems to stop at around 17:00 - 18:00 and I have to restart the following morning.

CPU and memory stats are still not updating, I had some probs with the changes I made in the config file this morning so server was offline as you can see, then Zabbix appeared to update stats for about 10 minutes then nothing after that,

**TrevorD** · 04-07-2018, 02:15

So I monitored the performance of Zabbix last night and this morning and it looks like that any performance related issues co-insides with our VMware backups. I ended up disabling VMware Event log checks in the standard VMware template. Looks like Zabbix server has held up and did not stop overnight.

Looks like CPU and memory stats are now updating as required. Might have been the simple fact of re-adding the VCenter server back to Zabbix after upgrade to 3.4.11 or the disabling of the VCenter log check

.

Preprocessing manager process was still maxing out for a period of time and not really sure how to resolve this,

At the end of the day we are not using Zabbix as our primary monitoring solution and we are only using the appliance to monitor a subset of infrastructure. We will complete a full Zabbix build on CentOS or the like when we look at migrating completely to Zabbix and try and tune it appropriately.

Trev

**Pada** · 08-04-2019, 20:23

I now had the same issue with Zabbix 4.2 the moment I started monitoring vCenter (via a Zabbix 4.2 Proxy running in Docker).

I tried to resolve the crashing & performance issues by:

gradually increasing the VMWARECACHESIZE on the Proxy (by restarting docker container) till it stopped crashing with the memory error when I had it set to 384M, however then I started getting Proxy history cache issues.
then gradually increased HISTORYCACHESIZE on the Proxy to like 256M, but then my Proxy MySQL server got stuck (high CPU usage) on a commit statement and my Proxy got completely unresponsive in terms of gathering/forwarding metrics
increasing my Proxy MySQL resources (RAM, CPU), however this ended up just overloading my Zabbix Server, where it now started having History cache issues and my Server MySQL had very high CPU utilization suddenly too.

The culprit ended up being the "vmware.eventlog[{$URL}]" inside the "Template Virt VMware" template!
After I disabled it, my Server MySQL (Aurora MySQL db.t2.medium) CPU usage dropped from 40% down to 10% and my Zabbix nvps (new values per second) dropped from a peak of 2.2k down to 80nvps.

When I manually ran a count query for the amount of log entries that vmware.eventlog generated, it was 750,000 entries within like 8 hours.

**dougbee** · 18-07-2019, 17:45

Originally posted by Pada

I now had the same issue with Zabbix 4.2 the moment I started monitoring vCenter (via a Zabbix 4.2 Proxy running in Docker).

I tried to resolve the crashing & performance issues by:

gradually increasing the VMWARECACHESIZE on the Proxy (by restarting docker container) till it stopped crashing with the memory error when I had it set to 384M, however then I started getting Proxy history cache issues.
then gradually increased HISTORYCACHESIZE on the Proxy to like 256M, but then my Proxy MySQL server got stuck (high CPU usage) on a commit statement and my Proxy got completely unresponsive in terms of gathering/forwarding metrics
increasing my Proxy MySQL resources (RAM, CPU), however this ended up just overloading my Zabbix Server, where it now started having History cache issues and my Server MySQL had very high CPU utilization suddenly too.

The culprit ended up being the "vmware.eventlog[{$URL}]" inside the "Template Virt VMware" template!
After I disabled it, my Server MySQL (Aurora MySQL db.t2.medium) CPU usage dropped from 40% down to 10% and my Zabbix nvps (new values per second) dropped from a peak of 2.2k down to 80nvps.

When I manually ran a count query for the amount of log entries that vmware.eventlog generated, it was 750,000 entries within like 8 hours.

Thanks for this! I was having the same issue - even with testing on a small VMware cluster, my History Cache usage would rocket up. Trying to stop zabbix-server would result in a slow history sync (normally it shuts down within a few seconds) and eventually failing.

Disabling the eventlog item fixed it for me as well!

Ad Widget

Adding VCenter server to Zabbix causes performance problems

Adding VCenter server to Zabbix causes performance problems

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment