Ad Widget

**Alexei** · 16-09-2007, 17:02

You may consider using a single log[] with a complex GNU-style regular expression as a second parameter:

log[/my/log/file,"Complex regual expression which will match ALERT, ERROR, whatever"]

**cstackpole** · 17-09-2007, 15:25

I will give that a try and see what happens.
Thanks Alexei!

**cstackpole** · 18-09-2007, 17:49

Hello Alexei,
Just thought I would post back some of my experiences dealing with this high load. The server is 1.4.1 running on Debian.

I tried your solution of combining several of the statements into one. That didn't seem to make a noticeable difference. So I continued to work on the expressions.

Then last night, one of these servers went down. Zabbix never sent an email and the frontend still showed green all the way down. When the server came back online, the "server {HOSTNAME} is unreachable" went grey. The history shows "none" for the status for almost a week! I still got alerts when I would run a process to max the memory, push the load, or filled the hard drive space but I could turn off the agent (and/or the server!) and never get an email. I couldn't get the unknown state to commit to ON or OFF.

So off to the forums I went. I found this: http://www.zabbix.com/forum/showthre...5&page=1&pp=10

and began working my way through it. I verified that only the Linux agents of 1.4.1 were going into the unknown state and all the other systems were OK (the 1.1.7 and the 1.1.4 agents responded as they should). Nothing seemed to work for the 1.4.1 agents though. I stopped the agents on the systems then I stopped the Zabbix server and brought it all back up. Still unknown.

So I tried again. Brought down the agents. Brought down the server. I even turned off Apache this time. I don't know why I decided to check, but I ran ps to verify Zabbix was stopped; it was. Thats when I noticed MANY MANY MySQL entries; way more then I had seen before on this system. Top was showing that the load had not died down significantly after Zabbix was stopped (went from >5.5 to hovering around 5).

I stopped MySQL, checked it was gone (ps again) and that the load had dropped, then I started MySQL, Apache, zabbix_agentd, and zabbix_server. I brought up all the other agents as well and the UNKNOWN status is gone! Zabbix reports machines when they are offline/reboot again, the status is updated, and most importantly when I have the original set of log items enabled the load is sitting below 1.5!

I am really not certain what happened. Zabbix appeared to function just fine with 1.4 before I added the log items/triggers. I noticed the increase in load after enabling them, but it seems as if it was MySQL that was actually using all the resources. My concern is that MySQL just got back logged trying to keep up and if that is true then I can expect this to happen again. I will keep a closer eye on this and let you know if it happens again. I can always reboot my development machine every couple of days and make sure that Zabbix spots it.

Before this moment flees too far from my memory, is there anything that might be helpful to you guys? I know you have worked on this issue before so if there is something that might help out, please let me know. There wasn't anything in the logs that looked important to me or different from normal, but I could be wrong. I also grabbed a screenshot (though it looks similar to those already posted in the other posts).

If I can help, please let me know.
cstackpole

**Alexei** · 25-09-2007, 11:48

Debian and Ubuntu servers perform integrity checks of MySQL databases when restarted as far as I am aware.

Also, you may check MySQL processes and SQL statements (mysqladmin processlist) to see what exactly is going on when CPU load is too high.

**bbrendon** · 26-09-2007, 18:42

Regarding your first issue. That is the behavior I experienced as well when I upgraded. For some trends I didn't want to lose, I manually fixed it at the database level.

When I upgraded, I lost most all my history.

Ad Widget

Two minor annoyances: Upgrade and Monitoring Log files

Two minor annoyances: Upgrade and Monitoring Log files

Comment

Comment

Comment

Comment

Comment