No announcement yet.

Standing up Zabbix in AWS

  • Filter
  • Time
  • Show
Clear All
new posts

    Standing up Zabbix in AWS

    Hi all. I am entering a PoC to bring Zabbix to AWS utilizing an RDS backend. I have done a handful of reading around The Google to find that it has left me with more questions than answers.

    First, I am reading a lot of users are still using Zabbix 3.0 vs the latest (not in development) I see the latest production version 3.4 is not LTS, does it have any plans on supporting 3.4 for LTS, or are they skipping straight to 4.0 which is in development?

    Second, I am reading a lot of users that are having problems with performance while utilizing an AWS RDS backend. A lot of them seem to be in regards to its performance during its backup windows when all the Zabbix agents become unavailable, and the Admins are flying blind as to what is happening during this backup window. Can anyone enlighten me on some of the cons, or do they have an positive experience using Zabbix with a RDS backend? We will be looking to support ~600 servers with this solution.

    A few of my concerns are that we implement Zabbix 3.2 for the LTS support, and then by the time we migrate fully into AWS it will be time for an overhaul already. I would like to plan for four years out of possible. Lastly, performance on an RDS instance will be huge. My PoC for this will only be 20 to 30 systems at the most, it would be impossible for all 600. Any future insight into possible blockers or potential problems would be appreciated greatly.

    Thanks for your time all. Looks like you have a pretty strong community here.

    Zabbix 4.0 will be the next LTS. The Zabbix Life Cycle changed with Zabbix 3.0 and up so that the x.0 releases are the LTS versions.

    RDS should be fine for Zabbix. Just as with setting up your own DB server, be sure to size it properly. With any database solution, you will lose performance during backups. So be sure that you factor in your backups and how much of a performance impact taking a backup will make.

    For instance, I run a 3 node MySQL 5.6 Galera Cluster for my Zabbix DB server.
    They are m4.2xl instances (8CPU+32GB ). My DB is currently 100GB in size, and I monitor ~150 items per second over ~200 hosts. When my full backup fires off, my iops spike to 1200. However, my monitoring is not affected as the disks I've provisioned are more than able to handle it.

    I also have RDS instances in AWS for non-Zabbix usage, and same with them, I've provisioned a beefy instance with dedicated iops so that when it snapshots, I don't have an app performance issues.

    You're going to want to over provision your RDS instance, as well as the disk. It's better to have too many resources than not enough. There's no such thing as "just enough".


      Reposted below
      Last edited by w00tburger; 14-03-2018, 19:18. Reason: Reposted below


        Does anyone happen to have a hard date as to the release other than Q1? I am literally at the point where I am standing my PoC up in our AWS using Zabbix 3.0 since its the current LTS. I would rather wait on my PoC if I know that Zabbix 4 is going to be out within the next month or so (or at the very least, use the alpha as our PoC), rather than going through the headache of deploying 3, just to be upgrading once the AWS migration is complete.

        Also, I had been working with the auto-discovery with Zabbix and was curious, is there still no way to automatically de-register a host if it scales down in AWS, or will that require custom scripts on the servers and an interaction with the Zabbix API?


          There's no hard date. Considering 4.0.0 alpha2 was only released in January, and there have been no betas released yet, there's probably a few more months of waiting.


            when scaling is needed, you might want to take a look at cockroachDB for the backend database. Zabbix needs a few minor tweaks te be able to use it. Also, when a dbnode crashes, in a 3 node cluster, the rest keeps your application working without interuption. As a matter of fact, as long as more than half of the db nodes is working, the db remains available.


            Ask questions to Zabbix Dev Team in person at the Zabbix Summit 2018!