Ad Widget

**badbob001** · 26-05-2020, 21:03

Tried looking through the source code and I have to admit that it has been a very long time since I've looked at C code.

In cpustat.c, I see avg1, 5, and 15 used here:

Code:

int get_cpustat(AGENT_RESULT *result, int cpu_num, int state, int mode)
{
...

switch (mode)
{
   case ZBX_AVG1:
[B]time[/B] = SEC_PER_MIN;
   break;
   case ZBX_AVG5:
[B]time[/B] = 5 * SEC_PER_MIN;
   break;
   case ZBX_AVG15:
[B]time[/B] = 15 * SEC_PER_MIN;
   break;
   default:
   return SYSINFO_RET_FAIL;
}

..

if (1 == cpu->h_count)
{
   for (i = 0; i < ZBX_CPU_STATE_COUNT; i++)
      total += cpu->h_counter[i][idx_curr];
   counter = cpu->h_counter[state][idx_curr];
}
else
{
   if (0 > (idx_base = idx_curr - MIN(cpu->h_count - 1, time)))
         idx_base += MAX_COLLECTOR_HISTORY;

   while (SYSINFO_RET_OK != cpu->h_status[idx_base])
      if (MAX_COLLECTOR_HISTORY == ++idx_base)
         idx_base -= MAX_COLLECTOR_HISTORY;

   for (i = 0; i < ZBX_CPU_STATE_COUNT; i++)
[B]total[/B] += cpu->h_counter[i][idx_curr] - cpu->h_counter[i][idx_base];
   counter = cpu->h_counter[state][idx_curr] - cpu->h_counter[state][idx_base];
}

...

SET_DBL_RESULT(result, 0 == [B]total[/B] ? 0 : 100. * (double)counter / (double)total);

https://git.zabbix.com/projects/ZBX/...ture/ZBX-15210

As best I can make out, avgX affects the size of integer time, which then affects the number for idx_base (starting point in metrics to look at?). And then variable total is cumulative sum of the current metric value minus base metric value... I'm guessing the "current minus base" aspect is related to how /proc/stat stores cpu time cumulatively from the start of the system. Unclear what counter is but I'm guessing the count of number of items between base and current.

And the last line I think says return 0 if total is 0 otherwise return "100 x counter / total", which I don't totally don't understand. Some sort of average formula that results in a percentage? I would expect an average formula to be like: (total / counter) x 100.

Still unclear if this means that sampling is one-per-second or that is just the minimum possible sampling. Or maybe that depends on the update interval set for the item to be monitored? I have my template item for cpu percent set at 1m update interval. Would that mean that avg1 of a 1-minute sample would just be the same value without any averaging?

Ad Widget

How is the average of proc.cpu.util calculated (avg1, avg5, avg15)?

How is the average of proc.cpu.util calculated (avg1, avg5, avg15)?

Comment