Try our new research platform with insights from 80,000+ expert users
it_user137748 - PeerSpot reviewer
Software Test Engineer at a comms service provider with 501-1,000 employees
Vendor
The monitoring and notification when an issue arises are the most valuable features.

What is most valuable?

The monitoring and notification when an issue arises.

How has it helped my organization?

It was essential we had to have live monitoring as we were trial-ing a real-time communications application on a live site.

What needs improvement?

While the UI is functional, I have always thought there was room for improvement here. It is at times assumptive for the not so technical, rather than informative.

For how long have I used the solution?

I've been using it for 12 to 18 months. Presently I work as a software test engineer and my employer has been using Nagios for approximately ten years now. I first used Nagios in January 2014. I was in a small startup company based in the TSSG called Kodacall, it was based around WebRTC. I implemented in order that we could monitor our network as we had some clients who very kindly were letting us test our product on their live site. I really found Nagios a perfect fit for us and a good product to work with. As the saying goes “It does exactly as it says on the tin”. I hope what I have given might be of some small help.

Buyer's Guide
Nagios XI
January 2025
Learn what your peers think about Nagios XI. Get advice and tips from experienced pros sharing their opinions. Updated: January 2025.
831,265 professionals have used our research since 2012.

What was my experience with deployment of the solution?

It was deployed as a stand alone tool and is also integrated with Docker.

What do I think about the stability of the solution?

It was deployed as a stand alone tool and is also integrated with Docker.

What do I think about the scalability of the solution?

It was deployed as a stand alone tool and is also integrated with Docker.

How are customer service and support?

I didn’t require any service as the documentation is good and most questions are answered on any number of forums.

How was the initial setup?

Setup ran pretty smooth. It's not overly difficult but it does require patience and some experience would make it easier. Like everything it is easier the second time around.

What about the implementation team?

We implemented it in-house.

Which other solutions did I evaluate?

Although we did look at other tools, we felt Nagios was a good fit for us.

What other advice do I have?

I would have no hesitation in recommending it.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
NOC Engineer at a financial services firm with 501-1,000 employees
Vendor
It provides basic and complex monitoring of our environment as well as application-level monitoring, but notes can only be managed through the database.

What is most valuable?

There are a lot of plugins available and you can also write your own.

How has it helped my organization?

The product allows comprehensive monitoring of your environment. From basic monitoring of your servers health, to more complex tasks like AD, database, and, to some extent, application-level monitoring. Another great feature is that the event handlers allow you to automate repetitive tasks in your environment.

What needs improvement?

There's no convenient way to manage notes as the only way is to manage them through the database.

For how long have I used the solution?

I've used it for three years.

What was my experience with deployment of the solution?

No issues encountered.

What do I think about the stability of the solution?

No issues encountered.

What do I think about the scalability of the solution?

No issues encountered.

How are customer service and technical support?

Customer Service:

I've worked on several occasions with Nagios customer service and technical support, always received very professional and prompt responses.

Technical Support:

I've worked on several occasions with Nagios customer service and technical support, always received very professional and prompt responses.

Which solution did I use previously and why did I switch?

No previous solution used.

How was the initial setup?

It's very straightforward and and it has good documentation.

What about the implementation team?

We implemented it in-house.

What's my experience with pricing, setup cost, and licensing?

Nagios Core is a free-ware which has almost the same functionality as NagiosXI. It's a bit less convenient to use, since it doesn't have UI for the configuration.

Which other solutions did I evaluate?

No other options were evaluated.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user423219 - PeerSpot reviewer
it_user423219Sales Team at a tech vendor with 11-50 employees
Real User

Hi Dima, You can add config notes in Nagios Core and comments. It's not very straight forward at the moment though. I've heard it's being revamped in coming versions though. One other thing you can do is link to a wiki if you have one internally. Thanks for sharing your experience :)

Buyer's Guide
Nagios XI
January 2025
Learn what your peers think about Nagios XI. Get advice and tips from experienced pros sharing their opinions. Updated: January 2025.
831,265 professionals have used our research since 2012.
PeerSpot user
Systems Engineer at a tech services company with 501-1,000 employees
Consultant
It has somewhat helped improved the workflow and technical processes with regards to response times. A better multi-tenant environment would be ideal.

What is most valuable?

It has a wide variety of plugins in existence, but when there's no plugin available it is quite simple to integrate one's own programs and scripts.

How has it helped my organization?

It has somewhat helped improved the workflow and technical processes with regards to response times, and identify the frequent point of failures so as to architect alternative solutions. Plus all the stuff about SLA is an area that for me that is hard to quantify, but there's a team of accountant types who are dedicated to pulling numbers out of their hats.

What needs improvement?

It would be nice to have a better, or alternative, dashboard, à la Thruk, to see business process groupings, rather than just host and services. Tis might give a better visual representation of how the company is performing, and the availability of mission critical services etc.

A better multi-tenant environment would be ideal where certain users have limited visibility instead of just limited functionality.

For how long have I used the solution?

I've been using it for about six or seven years. We currently have v3.4.1 in production and are currently developing and testing v4.1.1 prior to implementation. Currently, we use various plug-ins including NRPE, NSCA, NagiosQL 3.2.0, PNP4Nagios 0.6.24, NSclient++ 0.3.9, and nagios-plugins 2.1.1.

What was my experience with deployment of the solution?

We have had no issues with the deployment.

What do I think about the stability of the solution?

We're still experiencing stability issues with regards to collecting performance graphs.

What do I think about the scalability of the solution?

It's been able to scale for our needs.

Which solution did I use previously and why did I switch?

We migrated away from being Windows centric. Their prodcut, although quite powerful, it was Windows centric and as our fleet of Linux servers increased an alternative was needed to suport multiple operating systems.

How was the initial setup?

The installation and configuration was straightforward and not complex, but rather time consuming as there was no easy method to centrally deploy plugins or agents to hundreds of remote clients. The only complexity that we encountered was in the network configurations for our firewalls and routing.

What about the implementation team?

All work is done in-house, and over the years it seems easier to Google for a solution to a given problem, but often enough a problem will crop up where no-one has yet found a solution. You need to learn to take notes and document everything that you. This may seem time consuming, but it will certainly save a lot more time and work in the future.

What other advice do I have?

It gets the job done, but there's a lot of room for improvement. Make sure that you clearly identify which are the mission critical services and which aren't so as to avoid cluttering the dashboard and overwhelming IT staff with too much information.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
reviewer2275686 - PeerSpot reviewer
Lead Solution Consultant at a tech services company with 51-200 employees
Real User
Top 5Leaderboard
Scalable product with an easy setup process
Pros and Cons
  • "It is an open-source platform with valuable features for performance and stability."
  • "The product's stability could be even better."

What is our primary use case?

We use Nagios XI for server monitoring.

What is most valuable?

It is an open-source platform with valuable features for performance and stability.

What needs improvement?

The product's stability could be even better.

For how long have I used the solution?

We have been using Nagios XI for five years.

What do I think about the stability of the solution?

I rate the product's stability a nine out of ten. There is room for improvement.

What do I think about the scalability of the solution?

I rate the product's scalability a nine out of ten.

How was the initial setup?

The initial setup is easy. It requires a team of ten engineers to execute the process.

What about the implementation team?

Our employees implement the product with the help of third-party vendors.

What other advice do I have?

I rate Nagios XI a nine out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Engineer at a tech vendor with 51-200 employees
Real User
Only Monitoring Tool You will Need

Why is OMD a Better Choice than Zabbix or Zenoss

I spend about 3 weeks vetting through 20+ open source monitoring solutions and at the end of the process, the choices had boiled down to few major ones - OMD (best combination of open source plug-ins put together for Nagios), Zabbix, and Zenoss.

The main components of OMD are Check_MK, PNP4Nagios, Nagvis, and of course Nagios. Among these projects, Check_MK is the core of OMD that makes Nagios easy to configure, easy to scale, and mashed together all the other popular Nagios plug-ins into one unified user interface. Thus the following comparisons are done using Check_MK as the keyword, but I will also cover how other plug-ins makes OMD project stand out from the competitions.

Trend

Check_MK vs Zabbix vs Zenoss Core Trend

enter image description here
A quick Google Trend search will tell you that Check_MK is up and rising. Together with the Nagios’s community size, you can certainly find custom monitoring plug-ins created by community members and save yourself time from reinventing the wheel.

Project Health

Before you pick any open source tool for enterprise projects, you want to make sure that their code is not stale and the community is vibrant for the years to come. Active community and frequent code updates ensure your questions get answered and fast bug fixes. Free service from Ohloh will give you an overview of those aspects on open source projects. The following comparison charts are created from Ohloh.

Number of Code Commits Made by Each Project

enter image description here
Check_MK is a clear winner in this chart. It tells you that Check_MK is constantly making more improvement than the other 2 projects.

Number of Contributor of Each Project

enter image description here
In this chart, Check_MK’s contributor is increasing and will soon surpass Zabbix. And don’t forget it is standing on giant’s shoulder, the largest monitoring community - Nagios.

User Reviews

Don’t just listen to me. Here is one of the blog post that talks about why moving away from Zabbix to Check_MK.

Moving From Zabbix to Check_MK

Architecture Design Advantage

OMD

What is OMD:
OMD is a combination of best practices on how Nagios should be setup and integrated. It has incorporated all of the most popular 3rd party Nagios plug-ins in single easy to maintain, easy to install, and easy to upgrade package. Once you have your Linux server running, installing and have your OMD monitoring suite running only takes about 10 minutes with one command.

Administrators can really save time on not having to compile Nagios, or other plug-ins, trying to integrate and mess with configurations between plug-ins and Nagios. It really is a no-brainer to setup and start with.

Why use OMD instead of other flavors of Nagios combos, e.g. ?
Founded July, 2010 by a group of well known Nagios community members and Nagios addon developers
e.g. NagVis, Check_MK, PNP4Nagios, and others

Check_MK

What is Check_MK

Check_MK is an extension to the Nagios monitoring system that allows creating rule-based configuration using Python and offloading work from the Nagios core to make it scale better, allowing more systems to be monitored from a single Nagios server.

enter image description here
There are 2 significant modules that Check_MK uses to improve Nagios performance. One is called Livestatus and the other is called Livecheck.

Livestatus

Before Livestatus ☹
  • Monitoring results are stores to a single file status.dat. It becomes a bottleneck on CPU and IO for larger installation.
  • status file status is not realtime, default is to update every 10 seconds.
  • NDOUtils utilize databases for monitoring results (MySQL or PostgreSQL), but still have some severe shortcomings.
  • NDOUtils has complex setup.
  • NDOUtils needs a databases to be administered, a rapidly growing one.
  • NDOUtils eats up significant portion of your CPU resources just to keep the database up to date.
  • Some similar projects that still uses NDOUtils:
  • Regular housekeeping of the database can hang your Nagios for minutes or even an hour once a day.
After Livestatus ☺
  • Livestatus also uses Nagios Event Broker API like NDO, but it does not actively write out data. Instead, it opens a socket by which data can be retrieved on demand.
  • Livestatus imposes no measurable burden on CPU at all.
  • Livestatus produces zero disk IO when querying status data.
  • No configuration is needed. No database is needed. No administration is necessary.
  • Livestatus scales well to large installation even beyond 50,000 services.
  • Livestatus give you access to Nagios-specific data that is not available to any other methods.

Livecheck

Before Nagios 4.0, Even a perfectly tuned system rarely manages to execute more then a few thousand checks per minute.
What make things worse: while your system is getting larger, the maximum check rate is even getting worse. The more hosts and services your system manages, the less checks per second it will be able to perform. Why?
Existing Problems of Nagios (before Nagios 4.0) ☹
  • Each new check creates a new fork
  • The new process prepare everything needed to execute the check plug-in, then fork the second time when ready
  • Forking is costly even for highly optimized Linux kernel
  • The forking of Nagios core (before v.4.0) does not scale on multiple CPUs (single thread process).
  • you can well run into a situation where your powerful 16-CPU server is limited to 100 Checks per second while most of its CPU cores are idle most of the time.
How does Livecheck solve those bottlenecks ☺
  • It uses a number of helper processes. The core communicate with each helper through a Unix socket (that does not appear in file system).
  • Only a small helper program is forked instead of the complete Nagios monitoring core.
  • The helper forks distribute over all available CPUs instead of single CPU.
  • The total process VM size of Livecheck is about 100KB only!
  • Inline implementation of check_icmp (PING tests). To give you an idea of how much improvement this has done, here is a benchmark example using dual core 2800 MHz CPU:
    • Before inline check_icmp: 300 ICMP checks per second.
    • After inline check_icmp: 2600 ICMP checks per second. The checks generated an ICMP traffic of 45Mb/s.

Nagios Monitoring Core working with the best plug-ins (Check_MK, NagVis, PNP4Nagios and etc)

enter image description here

Multisite - An Advance Web Interface for Nagios

Multisite is part of the Check_MK project as a better web UI alternative for Nagios.

A new and innovative GUI for viewing Nagios status information and controlling your monitoring system. It is based on MK Livestatus and aims at replacing the Nagios web GUI (also known as “the CGIs”). Multisite supports distributed monitoring in a very efficient way.

Zero Configuration Files with WATO

This is one of the most brilliant solutions from Check_MK project to tackle the notorious Nagios configuration disaster. Although Nagios is a flexible and powerful monitoring system, having to mess with its multi-level and confusing configuration files scares many people away. Now, there are many web interface plug-ins that try to take a stab at the issue, but WATO is by far the best that simplify the complexity of Nagios configuration while staying very flexible and more flexible by sitting on top of Check_MK.

WATO is a web based administration tool for Check_MK. It allows you to manage your hosts and services to be monitored and perfectly supports Check_MK’s mechanism of inventory to autodetect services to be checked on a host. WATO allows to move a substantial part of the daily workload from the monitoring administrator to his colleagues.

Monitoring Agent for both Linux and MS Windows

enter image description here

Responsive UI for Mobile Client

Powerful Search Function

Visual Meters with Perf-O-Meter

enter image description here

NOC with Dashboards (Thanks to PNP4Nagios & Nagvis)

PNP4Nagios

Nagvis

NagVis is a visualization addon for the well known network managment system Nagios. NagVis can be used to visualize Nagios Data, e.g. to display IT processes like a mail system or a network infrastructure.

Sample Navigation in Nagvis

Automation and Web Services for Automated Provisioning

Automation is build into Multisite. You can make web service request against Multisite to automate adding new host, enabling new service checks, or embed any of the host/service check web pages into any other websites.

This feature makes it very easy to integrate with Puppet or Chef for automatically adding new servers(hosts) and services to the monitoring system.

24/7 NOC with Flexible Notification

With Check_MK abstracting the original Nagio’s notification scheme, it has become possible to send notifications of any hosts or services to any number of people at any time.

You can even create custom script to send the notification in some creative ways like having the notification be ☎called via a VoIP server to your cell phone and read you the alert message or have the alert be sent to your ✐instant messenger.

Custom Icons

http://mathias-kettner.de/checkmk_devel_multisite_icons.html

Management and Maintenance

Distributed Monitoring

Distributed WATO allows you to manage several monitoring sites through a logically centralized WATO.
  • 1200 Check_MK installations
    • Centralized status of all 1200 stores per minute
    • Using NagVis to show all 1200 stores’ status on the map using the Geomap function
    • All stores’ overall status is aggregated through the use of the Business Intelligence function

enter image description here

Backup of Changes

  • Automatic Check_MK configuration backup on every change you make
  • Easy restoration with the Thunder icon
    enter image description here

Upgrade OMD

Business Intelligence

Available from version 1.2.3

Predictive Monitoring

  • Smart threshold that detects anomaly from daily operation
  • Set warning level based on prediction
    enter image description here

Available from version 1.2.3

Monitor Cronjobs

Before

<code>5 0 * * * root /usr/local/bin/backup >/dev/null</code>

After

<code>5 0 * * * root mk-job nightly-backup /usr/local/bin/backup >/dev/null</code>

Available from version 1.2.3

Dive in to the OMD World

I will be sharing how I install OMD, optimized web interface (Multisite), utilized passive checks, implemented 24/7 on call plan, and integrated with automated business processes. I will add link here once they become available.

  1. How to Setup OMD in 1 Hour
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Chris Childerhose - PeerSpot reviewer
Chris ChilderhoseLead Infrastructure Architect at ThinkON
ExpertTop 5Real User

Wow very nice write up and detail. We are debating to switch off Nagios over to Solarwinds at the moment but I might have Management check this review.

See all 2 comments
it_user12228 - PeerSpot reviewer
Systems Administrator at a cloud solution provider with 501-1,000 employees
Vendor
I've used both Nagios and SolarWinds- different enviorments have different preferences

I've used both Nagios and SolarWinds, although I haven't messed with nagios as much as solarwinds. Solarwinds had some nice features for monitoring and I learned a lot about it in the short time that I actually worked with it. Nagios just seems to work and when a server is down, I investigate. Solarwinds seemed to have more issues but that could have been because it was running on a server 2003 box and possibly old hardware where as the company I'm working at now runs nagios on a linux box with some decent hardware. Again, I haven't delved deep into Nagios and it's possible that what I'm looking at is just a webpage front end to nagios that not everybody uses but it's still pretty nice regardless.

Not sure if different environments have different prefferences but the company I was working at that used solarwinds was an ISP. The company that uses nagios is a web hosting company. I've also seen a linux admin at a previous job use nagios so it may be that nagios is more popular among linux if not servers altogether.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user2652 - PeerSpot reviewer
it_user2652Project Manager at a non-tech company with 10,001+ employees
Top 20Vendor

Can we use Solarwinds to monitor linux as well as solaris operating systesm?

it_user1104738 - PeerSpot reviewer
R&amp;D Support &amp; Monitoring, IT Production at a tech vendor with 51-200 employees
Real User
Enables us to monitor servers but the visibility needs improvement
Pros and Cons
  • "The features I've found the most useful are the plug-ins, the fact that you can connect almost everything to it. That's very useful."
  • "There's room for improvement in the visibility, and in the ability to extract information. Stuff like this should be more simple."

What is our primary use case?

We primarily use this solution to monitor servers and services.

What is most valuable?

The features I've found the most useful are the plug-ins, the fact that you can connect almost everything to it. That's very useful.

What needs improvement?

There's room for improvement in the visibility, and in the ability to extract information. Stuff like this should be more simple. Going over the data and the information is complicated at the moment. The management of the commands is also a little complicated.

For how long have I used the solution?

I have been using this solution for about half a year.

What do I think about the stability of the solution?

It's stable.

How are customer service and technical support?

I haven't had any experience with technical support.

How was the initial setup?

The initial setup was straightforward.

What other advice do I have?

In regards to the interface, obviously it can always get even better but at the moment it's functional. It's a good tool. It's very easy to use, very straightforward, and I would recommend it. I would rate this solution a seven out of 10. I like it but of course, it could be better. The product detail should look better, and again, being able to find information on the search is not good enough. I think to change that would improve it very much.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Senior Manager of Engineering at a tech services company with 501-1,000 employees
Consultant
The Reactor helped improved our script automation and self-response.

Valuable Features:

It has a lot of flexibility for customization and a wide range of metrics. Also, it is opensource and has a big community.

Improvements to My Organization:

The Reactor helped improved our script automation and self-response. XI has provided us with top flexibility for heterogeneous systems monitoring.

Room for Improvement:

Nagios needs to improve their incident manager. Currently, it isn't good for ITIL in my opinion. The Reactor is good to go, but XI needs to improve its reporting functionality.

Use of Solution:

We use bothe Nagios XI and Nagios Reactor.

Deployment Issues:

We have had no issues with the deployment.

Stability Issues:

There have been no performance issues.

Scalability Issues:

It's been able to scale for our needs.

Other Advice:

I would advise that you create a lab with Nagios Core and test what you really need. Although it's exciting to use all the products, only a few are really important in your IT structure. When you are confident with scripting and MIBS integration, you can consider expanding it to your Enterprise systems with Nagios XI and some other modules. I would discourage you from using the ticketing system to start with and choose something more dedicated.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free Nagios XI Report and get advice and tips from experienced pros sharing their opinions.
Updated: January 2025
Buyer's Guide
Download our free Nagios XI Report and get advice and tips from experienced pros sharing their opinions.