I'm a Database Admin for a large media company. We currently support an environment of about 200 database servers (MySQL, Sybase, Oracle and MS SQL). Our systems group has implemented Nagios and we use RT for ticket tracking.
We find Nagios to be slow and tedious to use, and its paging escalation features appear to be basic at best. Here's what I'd like to do with paging:
We have a rotating primary on-call DBA who changes every week. Additionally, we have a secondary who takes tickets if the primary is unavailable. Currently, all alerts that get sent from outside Nagios (which is most of them) get sent to everybody, which means the secondary gets as little sleep as the primary. Nagios of course does have the primary/secondary escalation feature, but we don't have it integrated into most of our systems because of the hassles of getting it configured.
Ideally, when we get an alert (via Blackberry email) I'd like the ability to have the primary to issue a command directly to the paging system via an email reply on the blackberry. Commands would be such things as:
Acknowledged (e.g. I'm checking this out, don't page me for another 20 minutes)
Pass to (another DBA)
Close (I've finished this)
Note (Here's what I did)
Ideally it would open a ticket in RT with the relevant info, then add notes to the ticket and close it once the issue is resolved.
I was originally going to write something in Perl, but if Zabbix can do this already, then so much the better.
Anyone?
Thanks,
Steve
We find Nagios to be slow and tedious to use, and its paging escalation features appear to be basic at best. Here's what I'd like to do with paging:
We have a rotating primary on-call DBA who changes every week. Additionally, we have a secondary who takes tickets if the primary is unavailable. Currently, all alerts that get sent from outside Nagios (which is most of them) get sent to everybody, which means the secondary gets as little sleep as the primary. Nagios of course does have the primary/secondary escalation feature, but we don't have it integrated into most of our systems because of the hassles of getting it configured.
Ideally, when we get an alert (via Blackberry email) I'd like the ability to have the primary to issue a command directly to the paging system via an email reply on the blackberry. Commands would be such things as:
Acknowledged (e.g. I'm checking this out, don't page me for another 20 minutes)
Pass to (another DBA)
Close (I've finished this)
Note (Here's what I did)
Ideally it would open a ticket in RT with the relevant info, then add notes to the ticket and close it once the issue is resolved.
I was originally going to write something in Perl, but if Zabbix can do this already, then so much the better.
Anyone?
Thanks,
Steve

Comment