The technology itself is generally very useful and the interface it great.
Senior Performance and Architecture Analyst at a manufacturing company with 10,001+ employees
Great technology with a nice interface
Pros and Cons
- "The solution is stable."
- "The technology itself is generally very useful and the interface is great."
- "There should be a clearer view of the expenses."
- "I find the setup cost to be too expensive. The setup cost for Datadog is more than $100. I am evaluating the usage of this solution, however, it is too expensive."
What is most valuable?
What needs improvement?
There should be a clearer view of the expenses.
For how long have I used the solution?
I have used the solution for four years.
What do I think about the stability of the solution?
The solution is stable.
Buyer's Guide
Datadog
May 2026
Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,942 professionals have used our research since 2012.
How are customer service and support?
I have not personally interacted with customer service. I am satisfied with tech support.
Which solution did I use previously and why did I switch?
I am using ThousandEyes and Datadog. Datadog supports AI-driven data analysis, with some AI elements to analyze, like data processing tools and so on. AI helps in Datadog primarily for resolving application issues.
How was the initial setup?
It was not difficult to set up for me. There was no problem.
What was our ROI?
I can confirm there is a return on investment.
What's my experience with pricing, setup cost, and licensing?
I find the setup cost to be too expensive. The setup cost for Datadog is more than $100. I am evaluating the usage of this solution, however, it is too expensive.
What other advice do I have?
I would rate this solution eight out of ten.Â
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Senior DevOps Engineer at MIM Software Inc.
Great documentation and learning platform with good built-in integrations
Pros and Cons
- "Datadog's learning platform is second to none."
- "Datadog's roadmap can be a bit unpredictable at times."
What is our primary use case?
We were looking for an all-in-one observability platform that could handle a number of different environments and products. At a basic level, we have a variety of on-premises servers (Windows/Mac/Linux) as well as a number of commercial, cloud-hosted products.
While it's often possible to let each team rely on its own means for monitoring, we wanted something that the entire company could rally around - a unified platform that is developed and supported by the very same people, not others just slapping their name on some open source products they have no control over.
How has it helped my organization?
Datadog has effortlessly dropped in to nearly every stage of observability for us. We appreciate how it has robust cross-platform support for our IT assets, and for integrating hosted products, enabling integrations often couldn't be easier, with many of them including native dashboards and even other types of content packs.
Over the last couple of years, we have onboarded a number of engineering teams, and each of them feels comfortable using Datadog. This gives us the ability to build organizational knowledge.
What is most valuable?
Datadog's learning platform is second to none. It's the gold standard of training resources in my mind; not only are these self-paced courses available at no charge, but you can spin up an actual Datadog environment to try out its various features.
I just hate when other vendors try to upsell you on training beyond their (often poorly-written) documentation. Apart from that, we appreciate the variety of content that comes from Datadog's built-in integrations - for common sources, we don't have to worry about parsing, creating dashboards, or otherwise reinventing the wheel.
What needs improvement?
Datadog's roadmap can be a bit unpredictable at times. For instance, a few years ago, our rep at the time stated that Datadog had dropped its plans to develop an incident on-call platform. However, this year, they released a platform that does exactly that.
They also decided to drop chat-based support just recently. While I understand that it's often easier to work with support tickets, I do miss the easy availability of live support.
It would be nice if Datadog continued to broaden its variety of available integrations to include even more commercial platforms because that is central to its appeal. If we're looking at a new product and there isn't a native integration, then that's more work on our part.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Datadog
May 2026
Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,942 professionals have used our research since 2012.
Senior Engineer at a retailer with 1,001-5,000 employees
Good monitoring capabilities, centralizing of logs, and making data easily searchable
Pros and Cons
- "The intuitive user interface has been one of the most valuable features for us."
- "While the UI and search functionality are excellent, further improvement could be made in the querying of logs by offering more advanced templates or suggestions based on common use cases."
What is our primary use case?
Our primary use of Datadog involves monitoring over 50 microservices deployed across three distinct environments. These services vary widely in their functions and resource requirements.
We rely on Datadog to track usage metrics, gather logs, and provide insight into service performance and health. Its flexibility allows us to efficiently monitor both production and development environments, ensuring quick detection and response to any anomalies.
We also have better insight into metrics like latency and memory usage.
How has it helped my organization?
Datadog has significantly improved our organization’s monitoring capabilities by centralizing all of our logs and making them easily searchable. This has streamlined our troubleshooting process, allowing for quicker root cause analysis.
Additionally, its ease of implementation meant that we could cover all of our services comprehensively, ensuring that logs and metrics were thoroughly captured across our entire ecosystem. This has enhanced our ability to maintain system reliability and performance.
What is most valuable?
The intuitive user interface has been one of the most valuable features for us. Unlike other platforms like Grafana, as an example, where learning how to query either involves a lot of trial and error or memorization almost like learning a new language, Datadog’s UI makes finding logs, metrics, and performance data straightforward and efficient. This ease of use has saved us time and reduced the learning curve for new team members, allowing us to focus more on analysis and troubleshooting rather than on learning the tool itself.
What needs improvement?
While the UI and search functionality are excellent, further improvement could be made in the querying of logs by offering more advanced templates or suggestions based on common use cases. This would help users discover powerful queries they might not think to create themselves.
Additionally, enhancing alerting capabilities with more customizable thresholds or automated recommendations could provide better insights, especially when dealing with complex environments like ours with numerous microservices.
For how long have I used the solution?
I've used the solution for five years.
What do I think about the stability of the solution?
We have never experienced any downtime.
Which solution did I use previously and why did I switch?
We previously used Sumo Logic.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Director of DevSecOps at CIBT
Consistent, centralized service for varied cloud-based applications
Pros and Cons
- "Our primary alerts, based on metrics and synthetic transactions, are the most used and relied upon for decreased MTTA/MTTR across all of our platforms. This is followed by deep log analysis that enables us to quickly and easily get to a preliminary root cause that someone on the infrastructure, platform or development teams can take and focus their attention on the precise target that Datadog revealed as the issue to be remediated."
- "They could enhance the alerting functions by creating a new feature to add direct SMS notifications, on-call rotation scheduling, etc., that could replace the need to have this as an external third-party solution integration."
What is our primary use case?
The current use case for Datadog in our environment is observability. We use Datadog as the primary log ingestion and analysis point, along with consolidation of application/infrastructure metrics across cloud environments and realtime alerting to issues that arise in production.
Datadog integrates within all aspects of our infrastructure and applications to provide valuable insights into Containers, Serverless functions, Deep Logging Analysis, Virtualized Hardware and Cost Optimizations.
How has it helped my organization?
Datadog improved our observability layer by creating a consistent, centralized service for all of our varied cloud-based applications. All of our production and non-production environment applications and infrastructure send metrics directly to Datadog for analysis and determination of any issues that would need to be looked at by the Infrastructure, Platform and Development teams for quick remediation. Using Datadog as this centralized Observability platform has enabled us to become leaner without sacrificing project timelines when issues arise and require triage for efficient resolution.
What is most valuable?
All of Datadog's features have become valuable tools in our cloud environments.
Our primary alerts, based on metrics and synthetic transactions, are the most used and relied upon for decreased MTTA/MTTR across all of our platforms. This is followed by deep log analysis that enables us to quickly and easily get to a preliminary root cause that someone on the infrastructure, platform or development teams can take and focus their attention on the precise target that Datadog revealed as the issue to be remediated.
What needs improvement?
The two areas I could see needing improvement or a feature to add value are building a more robust SIM that would include container scanning to rival other such products on the market so we do not need to extend functionality to another third-party provider. The other expands the alerting functions by creating a new feature to add direct SMS notifications, on-call rotation scheduling, etc., that could replace the need to have this as an external third party solution integration.
For how long have I used the solution?
I've been a Datadog user for almost ten years.
What do I think about the stability of the solution?
Datadog is very stable, and we've only come across a few items that needed to be addressed quickly when there were issues.
What do I think about the scalability of the solution?
Scalability is very favorable, aside from cost/budget, which limits the scalability of this platform.
How are customer service and support?
Both customer service and support need a little work, as we have had a number of requests/issues that were not addressed as we needed them to be.
How would you rate customer service and support?
Neutral
Which solution did I use previously and why did I switch?
Being an Observability SME, I have used many native and third party solutions, including Dynatrace, New Relic, CloudWatch and Zabbix. As previously mentioned, Datadog provides a superior platform for centralizing and consolidating our Observability layer. Switching to Datadog was a no-brainer when most other solutions either didn't provide the maturity of functions, or have them available, at all.
How was the initial setup?
The initial setup was very straightforward, and the integrations were easily configured.
What about the implementation team?
We implemented Datadog in-house.
What was our ROI?
For the most part, Datadog's ROI is quite impressive when you consider all of the features and functions that are centralized on the platform. It doesn't require us to purchase additional third-party solutions to fill in the gaps.
What's my experience with pricing, setup cost, and licensing?
The setup was dead simple once the cloud integrations and agent components were identified and executed. Licensing falls into our normal third-party processes, so it was a familiar feeling when we started with Datadog. Cost is the only outlier when it comes to a perfect solution. Datadog is expensive, and each add-on drives that cost further into the realm of requiring justifications to finance expanding the core suite of features we would like to enable.
Which other solutions did I evaluate?
What other advice do I have?
They should provide more inclusive pricing, or an "all you can eat" tier that would include all relevant features, as opposed to individual cost increases to let Datadog to become more valuable and replace even more third-party solutions that have a lower cost of entry.
Which deployment model are you using for this solution?
Hybrid Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Product Engineering Manager at FMG Suite
Good logging, easy to find issues, and saves time
Pros and Cons
- "The logging in general is one of my favorite features."
- "I love to have some DD guru come in and do a department training directly at our setup."
What is our primary use case?
We use the solution for APM, AWS, Lambda, logging, and infrastructure. We have many different things all over AWS, and having one place to look is great.
We have all sorts of different AWS things out there that are in C# and Node. Having a single place to log and APM into is very important to us.
Keeping track of the cloud infrastructure is also important. We have Lambda, containers, EC2, etc.
Having a super simple interface to filter the searching for APM and logging is great. It is super easy to show people how to use. This is super important to us.
How has it helped my organization?
Finding issues quickly is super important. Being able to create dashboards and alert on issues.
Having the ability to create dashboards has really taught us how to utilize the searching part of the system. We are able to share them, and build upon them so easily. Many iterations later people are putting some solid information out there.
Alerting is also important to us. We have set up many alerts that help us spot issues in the platform before they become bigger issues. This has enabled my teams to use incidents and address the issues so they are no longer problems.
What is most valuable?
Alerting on running systems is very helpful. Finding issues is quick. We have one place for logging, searching through. Being able to save these and reference them in the future and build upon them.
The logging in general is one of my favorite features. The search is so straight forward and easy to use. Just being able to click on a field and add it to search has taught me so much about the interface, It might not be as useful without a shortcut like that to teach me the system. We have Cloudflare logs in there, and I have no idea sometimes how to filter on such a buried piece of JSON. That is where the interface helps me by clicking on the add to search I get what I need.
What needs improvement?
The "Pager Duty" replacement is something we are very interested in. We only really use pager duty to call the team when things are down.
I love to have some DD guru come in and do a department training directly at our setup. We would love to have someone come in and show us the things we could do better within our current setup.
Also saving a bit of cash would also help if there are things we are doing that are costing us. It's a big enough tool that it is tough to have someone dedicated to manage.
For how long have I used the solution?
I've used the solution for a bit over a year at this point.
What do I think about the stability of the solution?
The stability seems good here too.
What do I think about the scalability of the solution?
Scalability seems good to me. I have no complaints
How are customer service and support?
I get answers from our contact, and one team member did reach out. It went well.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We used Loggly.
We switched because we wanted an all-in-one tool
How was the initial setup?
Some parts of our setup were tough. Some Windows container setups cost us a lot of time.
The AWS infrastructure was tough to fully turn on due to the large cost of everything being run.
What about the implementation team?
We handled the setup ourselves in-house.
What was our ROI?
This cost us more overall. ROI is hard to sell. That said, I can find issues way faster and see what is going on in my entire platform. I pay back the cost every month with productivity.
What's my experience with pricing, setup cost, and licensing?
It is going to cost you more than you think to keep everything running. We saw value in the one-for-all solution, however, it came at a premium to what we were paying.
Which other solutions did I evaluate?
We did evaluate Dynatrace.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Unified platform with customizable dashboards and AI-driven insights
Pros and Cons
- "The infrastructure monitoring capabilities, especially for our Kubernetes clusters, have helped us optimize resource allocation and reduce costs."
- "We'd like to see more advanced incident management capabilities integrated directly into the platform."
What is our primary use case?
Our primary use case for this solution is comprehensive cloud monitoring across our entire infrastructure and application stack.
We operate in a multi-cloud environment, utilizing services from AWS, Azure, and Google Cloud Platform.
Our applications are predominantly containerized and run on Kubernetes clusters. We have a microservices architecture with dozens of services communicating via REST APIs and message queues.
The solution helps us monitor the performance, availability, and resource utilization of our cloud resources, databases, application servers, and front-end applications.
It's essential for maintaining high availability, optimizing costs, and ensuring a smooth user experience for our global customer base. We particularly rely on it for real-time monitoring, alerting, and troubleshooting of production issues.
How has it helped my organization?
Datadog has significantly improved our organization by providing us with great visibility across the entire application stack. This enhanced observability has allowed us to detect and resolve issues faster, often before they impact our end-users.
The unified platform has streamlined our monitoring processes, replacing several disparate tools we previously used. This consolidation has improved team collaboration and reduced context-switching for our DevOps engineers.
The customizable dashboards have made it easier to share relevant metrics with different stakeholders, from developers to C-level executives. We've seen a marked decrease in our mean time to resolution (MTTR) for incidents, and the historical data has been invaluable for capacity planning and performance optimization.
Additionally, the AI-driven insights have helped us proactively identify potential issues and optimize our infrastructure costs.
What is most valuable?
We've found the Application Performance Monitoring (APM) feature to be the most valuable, as it provides great visibility on trace-level data. This granular insight allows us to pinpoint performance bottlenecks and optimize our code more effectively.
The distributed tracing capability has been particularly useful in our microservices environment, helping us understand the flow of requests across different services and identify latency issues.
Additionally, the log management and analytics features have greatly improved our ability to troubleshoot issues by correlating logs with metrics and traces.
The infrastructure monitoring capabilities, especially for our Kubernetes clusters, have helped us optimize resource allocation and reduce costs.
What needs improvement?
While Datadog is an excellent monitoring solution, it could be improved by building more features to replace alerting apps like OpsGenie and PagerDuty. Specifically, we'd like to see more advanced incident management capabilities integrated directly into the platform. This could include features like sophisticated on-call scheduling, escalation policies, and incident response workflows.
Additionally, we'd appreciate more customizable machine learning-driven anomaly detection to help us identify unusual patterns more accurately. Improved support for serverless architectures, particularly for monitoring and tracing AWS Lambda functions, would be beneficial.
Enhanced security monitoring and threat detection capabilities would also be valuable, potentially reducing our reliance on separate security information and event management (SIEM) tools.
For how long have I used the solution?
I've used the solution for two years.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Senior Software Engineer at angel Studios
A great tool with an easy setup and helpful error logs
Pros and Cons
- "The setup cost was minimal."
- "We did have an issue where a synthetic test was set up before the holiday break, and we were quickly charged a great amount. Our team worked with Datadog, and they were able to help us out since it was inadvertent on our end and was a user error."
What is our primary use case?
We currently have an error monitor to monitor errors on our prod environment. Once we hit a certain threshold, we get an alert on Slack. This helps address issues the moment they happen before our users notice.
We also utilize synthetic tests on many pages on our site. They're easy to set up and are great for pinpointing when a bug is shipped, but they may take down a less visited page that we aren't immediately aware of. It's a great extra check to make sure the code we ship is free of bugs.
How has it helped my organization?
The synthetic tests have been invaluable. We use them to check various pages and ensure functionality across multiple areas. Furthermore, our error monitoring alerts have been crucial in letting us know of problems the moment they pop up.
Datadog has been a great tool, and all of our teams utilize many of its features. We have regular mob sessions where we look at our Datadog error logs and see what we can address as a team. It's been great at providing more insight into our users and logging errors that can be fixed.
What is most valuable?
The error logs have been super helpful in breaking down issues affecting our users. Our monitors let us know once we hit a certain threshold as well, which is good for momentary blips and issues with third-party providers or rollouts that we have in the works. Just last week, we had a roll-out where various features were broken due to a change in our backend API. Our Datadog logs instantly notified us of the issues, and we could troubleshoot everything much more easily than just testing blind. This was crucial to a successful rollout.
What needs improvement?
I honestly can't think of anything that can be improved. We've started using more and more features from our Datadog account and are really grateful for all of the different ways we can track and monitor our site.
We did have an issue where a synthetic test was set up before the holiday break, and we were quickly charged a great amount. Our team worked with Datadog, and they were able to help us out since it was inadvertent on our end and was a user error. That was greatly appreciated and something that helped start our relationship with the Datadog team.
For how long have I used the solution?
We've been using Datadog for several months. We started with the synthetic tests and now use It for error handling and in many other ways.
What do I think about the stability of the solution?
Stability has been great. We've had no issues so far.
What do I think about the scalability of the solution?
The solution is very easy to scale. We've used it on multiple clients.
How are customer service and support?
We had a dev who had set up a synthetic test that was running every five minutes in every single region over the holiday break last year. The Datadog team was great and very understanding and we were able to work this out with them.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We didn't have any previous solution. At a previous company, I've used Sentry. However, I also find Datadog to be much easier, plus the inclusion of synthetic tests is awesome.
How was the initial setup?
The documentation was great and our setup was easy.
What about the implementation team?
We implemented the solution in-house.
What was our ROI?
This has had a great ROI as we've been able to address critical bugs that have been found via our Datadog tools.
What's my experience with pricing, setup cost, and licensing?
The setup cost was minimal. The documentation is great and the product is very easy to set up.
Which other solutions did I evaluate?
We also looked at other providers and settled on Datadog. It's been great to use across all our clients.
Which deployment model are you using for this solution?
Private Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Application Engineer at Discover Financial Services
Consolidates all our logs into a single place, making it easier to find errors
Pros and Cons
- "The best way it has helped us is by consolidating all our logs into a single place and making it easier to find errors."
- "Another issue that I have is with the search syntax, it could be simpler and it feels like there are too many ways to do the same things."
What is our primary use case?
We have a tech stack including all backend services written in TS/Node (mostly) and as a full stack engineer, it is crucial to keep track of new and existing errors. Our logs have been consolidated in Datadog and are accessible for search and review, so the service has become a daily tool for my work.
More recently, session replay has been adopted at my company, but I do not like it so much because the UI elements are not in their place, so it is very hard to see what the users on the web app are actually clicking on.
How has it helped my organization?
The best way it has helped us is by consolidating all our logs into a single place and making it easier to find errors. Previously using AWS Cloudwatch was cumbersome and time-consuming. One issue I do have with logs is the length of time they are on the platform. Some issues happen sporadically, so it would be good to have logs for longer than one month by default or make it a configuration.
Another issue that I have is with the search syntax, it could be simpler and it feels like there are too many ways to do the same things.
What is most valuable?
Logs search is the most valuable feature because it has consolidated all of our backend services logs into one place. Now we can see the relationship between them as requests are going from one service to other dependencies.
What needs improvement?
One issue I do have with logs is the length of time they are on the platform. Some issues happen sporadically, so it would be good to have logs for longer than one month by default or make it a configuration. I have yet to try rehydrating logs, so this might be an option I need to try. Another issue I have is with the search syntax, it could be simpler. The syntax is a bit cumbersome and there is not an intuitive to save them to look for similar searches in the future.
Finally, while my company replaced a different tool for session replay with DataDog's version, I find it clunky and in need of further improvements. For example, when troubleshooting a web portal issue, it is super important to know what the user clicked, but the elements are not where they should be in the replay.
It is also hard to find details about the sessions, and metadata such as user email, account, etc. that exist on other services with replay features.
For how long have I used the solution?
I have been using Datadof for approximately five years.
What do I think about the stability of the solution?
So far we haven't had any issues with uptime and Datadog has been available when needed.
What do I think about the scalability of the solution?
It seems to scale well as we continue to add services that need monitoring.
How are customer service and support?
I haven't had to contact support.
Which solution did I use previously and why did I switch?
Cloudwatch was not a great tool for what we need to do to troubleshoot issues.
What about the implementation team?
We deployed it in-house with intermediate expertise.
What was our ROI?
I am not sure how much we are paying, but I use the app often enough to feel like we are getting a good ROI.
Which other solutions did I evaluate?
I was not involved in the choosing process as a software engineer
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Datadog Report and get advice and tips from experienced pros
sharing their opinions.
Updated: May 2026
Product Categories
Cloud Monitoring Software Application Performance Monitoring (APM) and Observability Network Monitoring Software IT Infrastructure Monitoring Log Management Container Monitoring AIOps Cloud Security Posture Management (CSPM) AI ObservabilityPopular Comparisons
Cloudflare
SentinelOne Singularity Cloud Security
Wazuh
Splunk Enterprise Security
Zabbix
Snyk
Dynatrace
Microsoft Defender for Cloud
Darktrace
Prisma Cloud by Palo Alto Networks
IBM Security QRadar
New Relic
Splunk AppDynamics
SolarWinds NPM
Buyer's Guide
Download our free Datadog Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Datadog vs ELK: which one is good in terms of performance, cost and efficiency?
- Any advice about APM solutions?
- Which would you choose - Datadog or Dynatrace?
- What is the biggest difference between Datadog and New Relic APM?
- Which monitoring solution is better - New Relic or Datadog?
- Do you recommend Datadog? Why or why not?
- How is Datadog's pricing? Is it worth the price?
- Anyone switching from SolarWinds NPM? What is a good alternative and why?
- Datadog vs ELK: which one is good in terms of performance, cost and efficiency?
- What cloud monitoring software did you choose and why?



















