Monitoring of more than 40,000 items with a single Zabbix Server
Zabbix supports the stable operation of a system at a prominent large-scale educational facility in West Japan
Fukuoka University, which has its campus in the southwestern part of Fukuoka City, is a prominent university in West Japan that has 9 departments, offers 31 subjects, and has more than 20,000 students studying there, and this university has been dispatching talented individuals into the world while continuing to play a vital part in its local community. The system that provides a comfortable network environment at this campus in order to accomplish the role of being the infrastructure for educational and research activities is the educational/research system “FUTURE5” (Fukuoka University Telecommunication Utilities for Research and Education 5th generation).
West Japan’s largest education facility system that operates with the concept of “undisrupted operation” in mind
As can be understood in the “5” in the name of this system, the current system is operating in its fifth generation. From the year 1994, a year in which Internet connection environments were still rare, a system was quickly constructed at Fukuoka University, and it has been used as an education and research environment ever since. While incorporating elements that had become necessary at the time of their incorporation, such as improvements in the network speed, the adoption of a wireless LAN environment, the utilization of cloud services, and the strengthening of security countermeasures, large-scale updates have continued to be made approximately every five years until now.
In the newest version, FUTURE5, with the system that was used until now as a base, various new tests were incorporated. For example, Windows 8.1 environments were provided as client systems, co-operative education and training in which tablet devices are used was introduced, the wireless LAN environment was expanded, and security was improved. This network began operation in September of 2015, after approximately 200 wireless access points were installed in East Japan’s largest campus network in which approximately 10,000 PCs are connected.
Professor Masaru Okumura, the head of the Research and Development Office of the General Information Processing Center at Fukuoka University, the center in charge of design, prior inspection, and operation monitoring of FUTURE5 after it had been implemented, said that FUTURE5 is “designed with the concept of undisrupted operation in mind”.
This is because FUTURE5 is responsible for the entire campus network including administrative operations. When a problem occurs, classrooms and research activities are majorly affected. Therefore, working to improve system reliability by providing redundant devices and network pathways and by decentralizing machine rooms, and by using monitoring to quickly detect problems that occur, it is possible to provide services of stable quality to all persons on campus. In order to create such an “undisrupted network” at Fukuoka University, Zabbix has been adopted.
A licensing system capable of handling detailed expansion and is also backed by key and abundant introduction results and information
Professor Masaru Okumura,
the head of the Research and Development Office of the General Information Processing Center at Fukuoka University
Associate Professor Shou Fujimura,
the Research and Development Office of the General Information Processing Center at Fukuoka University
Even in the successive generations of the FUTURE system, naturally, system monitoring was performed in order to allow for stable operation. However, even during the time between the large-scale updates that were performed in five-year units, small expansions and additions were made to the system. One problem was that the commercial system monitoring tools that had been used previously required additional costs to be paid during each expansion.
Also, another problem was anxiety in regards to misdetection during daily operation. Associate Professor Shou Fujimura of the Research and Development Office of the General Information Processing Center at Fukuoka University reflected on the past generations of the system by saying: “There were many cases in which the monitoring tools would issue an alert and then we would perform a survey and end up finding nothing. Because of this, we ultimately had to make judgments based on what was sensed by our human operators, which was a great burden for us.” Professor Okumura also said, “If possible, I would not want to be manipulated by tools after the start of the operation of the network.”
At that time, the monitoring tool that was suggested by Internet Incentive Japan (IIJ) was Zabbix. It was stated that, because the system is based on open source software, the introduction cost is low and additional licensing fees do not need to be paid whenever the system is expanded. Also, the selection of this system was supported further because there is an abundance of information regarding Zabbix, there are many introduction results, particularly in large-scale systems such as data centers, and because the performance is reliable.
“Unlike the past, there are now more vendors that support open source software, so the sense of unease associated with support for this software did not exist to the same extent that it did in the past. Actually, we had been using Zabbix for monitoring in 20 to 30 servers in the research room that we operate, so I was already acquainted with the software.”
Accurate monitoring of a diversity of devices and service levels from multiple vendors
In addition to network devices from multiple vendors such as Cisco Systems and Alaxalia Networks, FUTURE5 also consists of virtual bases such as VMware, as well as multiple security devices. Additionally, a variety of services such as NTP servers, Active Directory, and VPN, are operated, and all of these services are monitored with Zabbix. Working to define the SNMP Trap at Fukuoka University made it possible to create a structure in which all of these different devices could be monitored.
Also, one of the characteristics of FUTURE5 is that it is a dual-stack network with both IPv4 and IPv6 being used, and both of these are monitored by Zabbix.
“Not only is alive monitoring by means of Ping performed, but the operational statuses of upper-layer services are monitored as well. In addition to monitoring the devices, I would like to rapidly detect phenomena related to ‘errors occurring even when the machine has sent a response’ by monitoring circumstances such as whether or not Web response times are restricted to an allowable range and whether or not authentication systems are appropriately returning responses from the viewpoint of end users.” (Professor Okumura)
Currently, over 200 switches, 1,200 devices, and 43,000 items are monitored with Zabbix Server 2.2 on a virtual base. The structure does not employ a redundant configuration of Zabbix servers, but VMware vMotion is used in order to absorb problems at the virtual base side when they occur.
Smooth introduction thanks to support from IIJ, and the monitoring of IoT devices in our field of vision
Mr. Kiyotaka Watanabe,
Engineer, Professional Service Section 1, Engineering Department, IIJ Kyushu Branch
The work of introducing Zabbix at Fukuoka University progressed with support provided by IIJ. “Even in such an extremely large-scale system such as FUTURE5, we were able to complete the construction of a monitoring portion in which Zabbix is used, by using a template to complete the project in short stages. In this way, the processes from the construction stage to the utilization of Zabbix played a role in the construction of FUTURE5.” (Kiyotaka Watanabe, Engineer, Professional Service Section 1, Engineering Department, IIJ Kyushu Branch)
After FUTURE5 went into full operation, there weren’t any major problems in particular. At first, there was the impression that the system would be easy to use once you become used to the visible interfaces that are difficult to deal with. Also, you could say that making fine adjustments during operation, such as by adding monitoring items and performing tuning, the occurrences of misdetection decreased.
Fukuoka University has also added its own plans. For example, over 200 switches are in operation in FUTURE5, but when an alert e-mail was received in the past, it was not possible to know the name of the building and the floor number of the switch in question or the rack holding the server in question, so the technicians had to create a register of the devices and specify them one by one. “One or two devices would not have been such a problem, but we could not handle having multiple alerts coming in at once. Therefore, we created a naming scheme during the initial design phase in order to enable operation of this system in fashion that would allow the names of devices to be intuitively understood simply by looking at the alert.” (Assistant Professor Fujimura)
In any case, the prerequisite of this system is stable quality that supports education and research without any disruptions occurring in the system. “We are currently operating FUTURE5 under the assurance that even parts of the system that are not visible to users promptly detect and deal with problems in order to help improve the quality of services.” (Professor Okumura)
1 Zabbix Server
Number of sites: 1
Number of monitored devices: 1,176
Number of triggers: 8,609
Number of item: approximately 43,333
Number of users: approximately 15
Hardware for Zabbix server
Want to share your story of using Zabbix Monitoring Solution?
|Fill out this Questionnaire|
Or contact our Marketing Team for further assistance.