I've got some "interior node" it services that do not have any triggers set on them, which are set to "problem, if at least one child has a problem".
Occasionally, i see where the child it services have all gone green (status = 0) but the interior nodes are still status = 4. I can go into the database and do a select on the services_links database to find all the child nodes and none of them have a non-zero status. I have to manually twiddle the database to set the status = 0 in the services table to fix it.
I don't know how zabbix sets those status fields, but some race condition or bug is getting exercised every now and then and failing to clear parent it services correctly (most of the time this works, it must be a 1-in-100 or 1-in-1000 kind of bug).
Occasionally, i see where the child it services have all gone green (status = 0) but the interior nodes are still status = 4. I can go into the database and do a select on the services_links database to find all the child nodes and none of them have a non-zero status. I have to manually twiddle the database to set the status = 0 in the services table to fix it.
I don't know how zabbix sets those status fields, but some race condition or bug is getting exercised every now and then and failing to clear parent it services correctly (most of the time this works, it must be a 1-in-100 or 1-in-1000 kind of bug).
Comment