What is our primary use case?
Log aggregation for us was a key component since we have a fairly old-school app running on VMs on bare metal. We previously didn't have much insight into our logs unless we manually tunneled them into each server.
The solution is reducing manual labor in troubleshooting problems in our environments server by server.
We also needed to monitor our Java app and MySQL database to understand their problems so that we could take action and resolve them.
Our use cases have since expanded to encompass all aspects of monitoring.
How has it helped my organization?
Before Datadog, all we had to go on was the gut reaction of the old guard on our team. While useful, the reactions and inherent knowledge only benefited a few folks.
Datadog has allowed us to create comprehensive dashboards and proactively send out alerts. We used the knowledge of people very versed with our products to help set up the platform and have since benefited from that.
The operative word here is visibility, and we've seen a huge improvement in that.
What is most valuable?
Seeing log trends and patterns and aggregate search was a huge first step for us. We then began using other features of the Datadog platform by enabling APM. After that, we did other integrations.
For us to have visibility into our app stack and the hardware we run has been highly beneficial.
We leverage APM, log management, and at least ten other integrations. Our DB, web servers, network, storage, and other areas are now monitored and hooked up to dashboards.
Dashboarding has also proven useful when information is going to be viewed by anyone in the organization.
What needs improvement?
Our experience has been overwhelmingly positive so far. That said, there is one area that could benefit from some polish. For example, I want to applaud the efforts in making the UI extremely usable and approachable. My suggestion would be to take another look at how the menu structure is put together, however. Even after using the platform mostly every day for months, I still find myself trying to find a service or feature in the menus.
For how long have I used the solution?
I've used the solution for around six or eight months. We've had the Datadog agents deployed on our various environments.
What do I think about the stability of the solution?
So far, we have not had any issues with stability. It should be very stable and easy to update.
What do I think about the scalability of the solution?
The solution is currently deployed on a limited scale. That said, we see the potential and benefits of deploying this in a cloud scenario.
How are customer service and support?
Customer service and the support teams have been very responsive when we need them. They are very professional.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
This was our first solution in this space.
How was the initial setup?
The initial setup steps with the agent are only confusing when using the config files for the first time. The main file includes a lot that you can specify elsewhere and it's not readily apparent which one to use until you dig in more.
What about the implementation team?
We did an in-house implementation.
What was our ROI?
Our ROI with Datadog has been very high. It's given us the ability to see how we're performing, which we didn't have before.
What's my experience with pricing, setup cost, and licensing?
Ensure you have your ingestion pipelines dialed in, or you'll likely spend more than you were expecting.
Which other solutions did I evaluate?
We evaluated free and open-source options, however, ultimately, we decided that we didn't have the manpower as a small company to maintain them.
What other advice do I have?
There is nothing that the documentation cannot help with; it's very good.
Which deployment model are you using for this solution?
Private Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.