Hello all:
Sorry this is a bit long winded...so I have been a Nagios user for a long time and am (shiny) brand new to Zabbix. I will try not to bore you with Nagios implementation details but some things about Zabbix are not patently obvious to someone who is a pretty hard-core Nagios type. Zabbix looks pretty awesome from a frontend and visualization standpoint and seems to bring all the functionality of Nagios, perfParse, Cacti and probably a few more things together under one system. I am trying to map certain concepts from Nagios to Zabbix, here are my questions:
Nagios in of itself is reasonably hard to configure, I would expect any capable tool (including Zabbix) to have some learning curve. I really like Zabbix so far since it goes beyond point in time monitoring and handles metrics over time very well. Cacti can do this as a companion to Nagios but it seems to me that your monitoring and metric gathering should happen at the same time to reduce network load, and your web frontend looks great too! I would like to replace my Nagios/PerfParse/Cacti with Zabbix if the above functionality can be reasonably remapped. I am sure I will have a whole bunch more questions once I give this more thought
Thanks for your time.
/Chris
Sorry this is a bit long winded...so I have been a Nagios user for a long time and am (shiny) brand new to Zabbix. I will try not to bore you with Nagios implementation details but some things about Zabbix are not patently obvious to someone who is a pretty hard-core Nagios type. Zabbix looks pretty awesome from a frontend and visualization standpoint and seems to bring all the functionality of Nagios, perfParse, Cacti and probably a few more things together under one system. I am trying to map certain concepts from Nagios to Zabbix, here are my questions:
- Service Checks - Nagios is great for monitoring service availability, like dns, mail, ftp, you name it. The check is performed from a client perspective via a plugin so that even if the service is running on the host, if the nagios plugin can't interact with the service then it returns warning or critical. Zabbix seems geared toward getting metrics *from* individual hosts like disk, cpu, and if mysql or apache is running, but how do you approach Zabbix to say "check to see if this website is available on that remote host"?
- Host Checks - Nagios checks the availability of a host when a "service" goes unavailabe and if the host is down then it surpresses notifications for all the services associated with that host until it comes back. If I setup Zabbix triggers for a bunch of things on a host, will I get them all if the host itself is down? Nagios can check host level items like disk space, cpu, etc. but the *easy* performance metrics and graphing are nonexistent. Zabbix seems to do this fantastically.
- Parent/Child Hierarchies - when defining a well planned Nagios installation you define parent child relationships between devices, i.e. a server is behind a switch so if the switch goes down don't check or alert on the server until the switch comes back. Service checks can also have dependencies, i.e. a web server uses a mysql backend on another server, so if that mysql is down don't check or alert on the web service until the mysql comes back...
- Nagios Plugins - there is a great mindshare of plugins for Nagios that do everything under the sun. Rather than roll new solutions is there any way to reuse the nagios plugins to perform checks? For example I have a dns server, and I want to query it for a particular record and return warning if a response time is "x" threshold, critical if response time is "y" threshold, or critical if the record is not found or does not match a predefined value. The check_dns plugin can do all that, and that is actually a minor example.
- Notification and Escalation - Nagios has escalations based on conditions. For example, check_dns 5 times over a period of 5 minutes before alerting. If an alert condition is still present follow this chain, alert sysadmins group. If alert goes unacknowledged for 1 hour then alert sysadmins AND 2nd level support. Does Zabbix have this kind of escalation?
- Reports - How do you create reports in Zabbix? For example I have a weekly rollup report that shows is a list hosts and their associated services that I kick back to management, it just has columns for the various states that host or service was over dd/dd/dddd - dd/dd/dddd period. The columns are percentages of up, down, warning, or critical states. I cannot see how to make something analogous in Zabbix
- Zabbix Templates - so I create a host and assign it the Unix_t template which associates a bunch of items with that host. Some are applicable, some are not. how do I delete the ones that are not? for instance not every host runs imap server
Once I create a host with a set of items I like, can I clone that entry? This is pretty easy to do in Nagios using Monarch.
Nagios in of itself is reasonably hard to configure, I would expect any capable tool (including Zabbix) to have some learning curve. I really like Zabbix so far since it goes beyond point in time monitoring and handles metrics over time very well. Cacti can do this as a companion to Nagios but it seems to me that your monitoring and metric gathering should happen at the same time to reduce network load, and your web frontend looks great too! I would like to replace my Nagios/PerfParse/Cacti with Zabbix if the above functionality can be reasonably remapped. I am sure I will have a whole bunch more questions once I give this more thought

Thanks for your time.
/Chris
) , so please correct me if I say something incorrect.

Comment