I've been seeing these messages since ramping up our monitoring load here, and I've seen them with 1.8, 1.8.3, and 1.8.4 as well. I'm not seeing anything in the logs that appears to be a triggering scenario for this behavior.
Example log entry:
10907:20110107:123208.317 [Z3005] Query failed: [1062] Duplicate entry '100100000047086-1294419600' for key 'PRIMARY' [insert into trends (itemid,clock,num,value_min,value_
avg,value_max) values (100100000047086,1294419600,1,96.292486,96.292486, 96.292486),(100100000046969,1294419600,1,96.294073 ,96.294073,96.294073),(100100000046970,1294419600, 1
,96.294073,96.294073,96.294073);
]
The extra ']' always appears on the line following the duplicate entry messages.
All history and trends tables were truncated back to empty earlier this week.
While I don't think its the only cause of the notification storms I've seen, I can't help but think it's related in some way. I had one proxy worth of systems start sending alerts shortly before the first of these messages showed up since upgrading to 1.8.4 yesterday.
The database is exclusive to zabbix, running on a quad-core AMD opteron with 6G ram. Disk is 2* 150G internal disks with software mirroring for the OS and DRBD on one partition from each disk to an identical partner system as part of an HA pair. Data is placed on one disk and InnoDB logs on the other disk.
5,632 active hosts
41,618 active items
39,468 active triggers
110.76 required values per second
Most hosts are behind proxies each serving 70 nodes, for a total of 78 proxies involved.
Is anyone else running anywhere close to this scale successfully?
Example log entry:
10907:20110107:123208.317 [Z3005] Query failed: [1062] Duplicate entry '100100000047086-1294419600' for key 'PRIMARY' [insert into trends (itemid,clock,num,value_min,value_
avg,value_max) values (100100000047086,1294419600,1,96.292486,96.292486, 96.292486),(100100000046969,1294419600,1,96.294073 ,96.294073,96.294073),(100100000046970,1294419600, 1
,96.294073,96.294073,96.294073);
]
The extra ']' always appears on the line following the duplicate entry messages.
All history and trends tables were truncated back to empty earlier this week.
While I don't think its the only cause of the notification storms I've seen, I can't help but think it's related in some way. I had one proxy worth of systems start sending alerts shortly before the first of these messages showed up since upgrading to 1.8.4 yesterday.
The database is exclusive to zabbix, running on a quad-core AMD opteron with 6G ram. Disk is 2* 150G internal disks with software mirroring for the OS and DRBD on one partition from each disk to an identical partner system as part of an HA pair. Data is placed on one disk and InnoDB logs on the other disk.
5,632 active hosts
41,618 active items
39,468 active triggers
110.76 required values per second
Most hosts are behind proxies each serving 70 nodes, for a total of 78 proxies involved.
Is anyone else running anywhere close to this scale successfully?
Comment