We're having some problems with the housekeeper process not being able to keep up and I'm wondering if there is an easy way to "manually" do the housekeeping.
Awhile back, the housekeeper was causing a tremendous amount of load on our DB server. At the time we were running zabbix 1.6.5 with postgres 8.3 on the backend, and it turned out that someone had done a mass update of several hosts, unlinking and re-linking some rather large templates. This caused the housekeeper to schedule a bunch of slow running deletes, purging a years worth of data for a 1,000 or more items.
We disabled the housekeeper and eventually purged the housekeeper table manually, but here is where we went wrong - no one ever re-enabled the housekeeper. I just noticed the other day as part of my work to upgrade to 1.8.3. Our postgres DB is now over 400GB and the housekeeper table has 1,700 rows in it.
I turned it back on, but after several days of a DB server load ranging from 20 to over 40, there is still 1,700 rows in the table so either the housekeeper isn't working or the process is just going to run forever. I tried starting the zabbix_server with no other workers (so no pollers, trappers, etc) to let just the housekeeper run, but it still doesn't seem to make any progress.
I can purge data easily enough, but what about the trend data? Is there a way to do that part manually, or is that most likely what zabbix is choking on? If I just purge all the data older than, say 90 days (and the housekeeper table) will zabbix have any problems with that? Other ideas?
Thanks,
-poul
Awhile back, the housekeeper was causing a tremendous amount of load on our DB server. At the time we were running zabbix 1.6.5 with postgres 8.3 on the backend, and it turned out that someone had done a mass update of several hosts, unlinking and re-linking some rather large templates. This caused the housekeeper to schedule a bunch of slow running deletes, purging a years worth of data for a 1,000 or more items.
We disabled the housekeeper and eventually purged the housekeeper table manually, but here is where we went wrong - no one ever re-enabled the housekeeper. I just noticed the other day as part of my work to upgrade to 1.8.3. Our postgres DB is now over 400GB and the housekeeper table has 1,700 rows in it.
I turned it back on, but after several days of a DB server load ranging from 20 to over 40, there is still 1,700 rows in the table so either the housekeeper isn't working or the process is just going to run forever. I tried starting the zabbix_server with no other workers (so no pollers, trappers, etc) to let just the housekeeper run, but it still doesn't seem to make any progress.
I can purge data easily enough, but what about the trend data? Is there a way to do that part manually, or is that most likely what zabbix is choking on? If I just purge all the data older than, say 90 days (and the housekeeper table) will zabbix have any problems with that? Other ideas?
Thanks,
-poul

Comment