Our use case is to provide cloud organization application monitoring. I use it for insight into what host in what region has activity or what market is using Datadog to its fullest potential and utilizing that for cost. This may also help determine who is using monitoring and setting alerts or just setting up monitoring and not doing anything about it. The use case can also be to check when the host or applications are down, or if the usage of CPU, memory, etc, is too high.
Infrastructure engineer at a insurance company with 10,001+ employees
Good infrastructure, helpful logs, and useful alerts
Pros and Cons
- "It has a high-level insight into the infrastructure model of the application and provides important detailed data on the host and metrics, which is the main concern of our customers."
- "I sometimes log in and see items changed, either in the UI or a feature enabled. To see it for the first time without proper communication can sometimes come as a shock."
What is our primary use case?
How has it helped my organization?
The solution has improved our organization from a market perspective. We have multiple departments and need some time to gather that data from a grouping point of view. Grouping that data via tag or seeing the separation is easy. In addition, it provides metrics and insights for senior leadership to have a high level of usage and cost. Application teams have better insight into their application, outages, when to plan for patches, updates, etc. Also, they have a better understanding of where the data gaps may be.
What is most valuable?
The infrastructure is the most valuable. It has a high-level insight into the infrastructure model of the application and provides important detailed data on the host and metrics, which is the main concern of our customers. It provides confirmation that the layer where the application is running is monitored and will be alerted when it is down and not functional. The customers can have ease of mind knowing their metrics are accurately being measured. The value of data provided, including service name, logs, and all other pertinent details tied to the host, makes it a valuable source of data
What needs improvement?
The solution can be improved via open communication to the broader audience on what has changed and what has not changed. I sometimes log in and see items changed, either in the UI or a feature enabled. To see it for the first time without proper communication can sometimes come as a shock.
Buyer's Guide
Datadog
February 2026
Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: February 2026.
883,546 professionals have used our research since 2012.
For how long have I used the solution?
I have been using the solution for three years.
What do I think about the stability of the solution?
The stability is great.
How are customer service and support?
Technical support is great. Datadog has the resources and knowledge to tackle questions.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I did not previously use a different solution.
How was the initial setup?
The initial setup is straightforward.
What about the implementation team?
The initial setup was handled in-house.
Which other solutions did I evaluate?
I did not evaluate any other solutions.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Cloud Engineer at a retailer with 51-200 employees
Good logs, analytics and dashboards
Pros and Cons
- "We can handle debugging and find out why things are breaking in our applications."
- "The documentation leaves a lot to be desired for new users."
What is our primary use case?
I am using the solution for monitoring metrics, logs, traces, etc. It's mainly for making dashboards as well as monitoring our services.
We also use Datadog to help centralize our incident management to show the logs, where issues spiked, and some metrics.
We use Datadog to do troubleshooting in Kubernetes, specifically in our Azure Kubernetes service. Beyond that, we are looking to use open telemetry in tandem with Datadog to further our log-tracing efforts. In the future, this may be expanded.
How has it helped my organization?
This solution improves our organization as now we have higher visibility into our application that we otherwise would not have.
Since the Datadog agent comes in three forms, agentless, scraping, and through the API, it is very flexible. It is this flexibility in how to report our logs that keeps our logs centralized and organized.
One major drawback of Datadog is the cost. Sometimes we set up flows in place to monitor resources that end up logging more than we thought, and the bill is too high.
What is most valuable?
Dashboards have been marrying the most valuable parts of Datadog. Dashboards use metrics that are very helpful for monitoring services. I recently used metrics to monitor the number of pods in Kubernetes, the spikes in requests in Kubernetes, and overall CPU and memory usage in our Kubernetes clusters.
We can also use log analytics to further our understanding. We can handle debugging and find out why things are breaking in our applications.
The log portion of Datadog has robust features to debug the applications we are running. I really appreciate the ability to use facets to par down the logs.
What needs improvement?
The documentation leaves a lot to be desired for new users. The documentation is way too much text and has no real information just to help get people started. Sometimes it doesn't help to read an entire essay just to get a grasp on how the logs or metrics work.
For how long have I used the solution?
I've used the solution for two years.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Datadog
February 2026
Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: February 2026.
883,546 professionals have used our research since 2012.
Senior IT Manager at a financial services firm with 1,001-5,000 employees
Good tags, easy integration, and increases visibility
Pros and Cons
- "The full stack of integrations made it easier to monitor the different technologies and platform providers, including Software as a Service providers, that otherwise would need a lot of work and customization to be able to see what is happening."
- "The product could be improved by providing remote control to agents, enabling them to execute automation and collections without requiring another automation tool or integration."
What is our primary use case?
The main use cases are to provide visibility to costs for each product in the company as well as to consolidate all the observability in one tool. We are moving the team from being an operational team that needs to keep the tool up and running (applying patches and resolving problems) to a team that is focused on providing meaningful visibility of the systems, applications, and services of the company. We want to add value where the developers and the systems administrators are not able to focus.
How has it helped my organization?
The organization changed from having a team to operate different tools and providers to being a team worried about enabling and creating different dashboards, alerts, and automations in order to reduce downtime and increase the visibility of all the products, systems, and applications used.
We moved from a full operation team to a team that adds value to IT, finance, product, back office, and any other team that requires correct information about the services provided while providing the possibility for them to create their own views and dashboards.
What is most valuable?
The tags are quite useful. They are providing the capability to give meaning to on-premises hardware (since it was not possible outside of cloud solutions and containers) as well to tag traces and logs.
The full stack of integrations made it easier to monitor the different technologies and platform providers, including Software as a Service providers, that otherwise would need a lot of work and customization to be able to see what is happening. We'd also need to use several other separate tools that would require an increase in the required staff to operate them. Datadog gave us the opportunity to have a single platform for observability.
What needs improvement?
The product could be improved by providing remote control to agents, enabling them to execute automation and collections without requiring another automation tool or integration.
Also, there is a lot of space for the FinOps discipline. For example, it could potentially provide better and richer information for the teams to check the costs and optimize the product.
For how long have I used the solution?
I've used the solution for one year.
What do I think about the stability of the solution?
The stability is very good even though we have had some minor problems recently.
What do I think about the scalability of the solution?
The scalability is very good. We've had no problems until now.
How are customer service and support?
Technical support is good. That said, we had some cases that needed to be escalated to get to a faster resolution.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We previously used AppDynamics. The tool was not providing good system visibility as it was limited and had a very high cost.
How was the initial setup?
The initial setup is somewhat complex. There is a need to create a new automation to install and deploy agents that needs to consider the required security for a financial company.
What about the implementation team?
We handled the implementation in-house.
What was our ROI?
The ROI is still being calculated.
What's my experience with pricing, setup cost, and licensing?
Users need to be aware of licensing control. With autodiscovery, the product can begin to come at a high cost.
Which other solutions did I evaluate?
We also looked into Splunk, ELK, and Dynatrace.
Which deployment model are you using for this solution?
On-premises
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Devops Engineer II at a comms service provider with 11-50 employees
Great CPU profiler and lots of features but can be overwhelming
Pros and Cons
- "Even if we don't end up using Datadog, it revealed problems and optimizations to us that weren't obvious before."
- "The sheer amount of products that are included can be overwhelming."
What is our primary use case?
We use the solution for monitoring our logs across distributed clusters. Right now, we have an Elasticsearch solution that is tied to each platform (our product is a PaaS solution).
We are looking at moving to a single pane of glass solution, which Datadog would be good for (plus, we could wrap up other tools like Prometheus, Grafana, Pagerduty, Pingdom, and more). We want to be able to have Datadog running on one single cluster and ingesting and processing logs from all our distributed clusters.
How has it helped my organization?
So far, we are just in the evaluation stages so it's hard to say how it's improved out organization. However, one positive impact it had is it's been just showing us an example of how to build in observability, metrics, tracing, etc., in a better way.
Even if we don't end up using Datadog, it revealed problems and optimizations to us that weren't obvious before. One potential reason why it may not help us is that we have strict rules around log parsing and may not be able to send it to an external organizaton for ingestion/processing.
What is most valuable?
The CPU profiler has been interesting even though it isn't our core use case.
We are finding that Datadog has way more offerings than originally expected, so we are constantly finding new parts of it that would be convincing to use.
The log and ingestion are very similar to our current Elasticsearch setup. We find the tracing and overall integration/ecosystem to be the most valuable part. Basically, the CPU profiler is a good example of a value add for a problem we knew we had yet was low priority and had hacky workarounds. The value proposition is in the ecosystem as a whole.
What needs improvement?
The sheer amount of products that are included can be overwhelming.
The solution requires better overarching UI, which would make things clearer. Even though I generally dislike the AWS UI, it makes the different services very clear, and it also makes where you are at any given point clear.
The sidebar for all the different services is a bit much.
I also found the tagging of logging pipelines to be a bit tedious. It would be great if, once marked up, it would automatically be a first-class citizen in Datadog.
For how long have I used the solution?
We are still in the evaluation stage and have used it less than one month.
What do I think about the stability of the solution?
The stability looks good so far.
What do I think about the scalability of the solution?
It seems easier to scale and build app functionality across multiple teams rather than other solutions.
Which solution did I use previously and why did I switch?
We have used Elasticsearch, Grafana, and Prometheus. We are still evaluating Datadog.
What was our ROI?
The product has provided good ROI by saving development time as well as time managing setting up ES.
What's my experience with pricing, setup cost, and licensing?
It is somewhat expensive compared to open-source options.
Which other solutions did I evaluate?
We evaluated Elasticsearch, Grafana, and Prometheus. We are still evaluating Datadog.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. evaluator
Software Engineer at a comms service provider with 11-50 employees
Industry-standard with good profiling and helpful alerts
Pros and Cons
- "The biggest thing I liked was the combination of all the things - monitoring, log aggregation, and profiling."
- "It can be overwhelming for new people as it has a lot of features."
What is our primary use case?
We use different tools for log collection and monitoring. Using Datadog will combine different use cases into one product that will be easier to manage.
The tools we use are open-source, so there is no commercial support. Having customer support would be ideal since we're a small team.
Profiling would be another great feature to have. Currently, it's manual. Having Datadog would give us a standard, and we don't have to do much manual work.
How has it helped my organization?
It will solve a lot of our problems. We have different tools for each of them in our organization; they are open-source and therefore not very well maintained with there is no customer support.
Having an industry-standard product such as Datadog would be ideal for us as we are short on manpower. Since this is a managed all-in-one product with readily available support, we will be able to focus on application logic rather than figuring out why a tool isn't working.
What is most valuable?
The biggest thing I liked was the combination of all the things - monitoring, log aggregation, and profiling. We have different tools for each of them in our organization and all of them are open-source. These are not very well maintained and there is no customer support.
Having an industry-standard product is ideal for us as we are short on manpower. Profiling is another amazing feature. Currently, we rely on some open-source solutions, and it's all done locally. Having it done on Kubernetes would give us more insights and help with performance. Alerting is again a nightmare for us. Datadog solves all of these issues.
What needs improvement?
It can be overwhelming for new people as it has a lot of features. The UI could certainly be improved. Having less information with better organization could help newcomers. I haven't seen the documentation, however, a well-organized documentation would invite many varied users.
For how long have I used the solution?
I've been using the solution for three years.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Sr Platform Engineer at a pharma/biotech company with 11-50 employees
Good logging with lots of great integrations and an interesting dashboard
Pros and Cons
- "Datadog has a lot of features to be able to drill down deep into the swath of logs that our platforms generate."
- "Some of the interface is still confusing to use."
What is our primary use case?
We use it mostly for logging log messages from our Kubernetes and EC2 instances, for example, system messages and errors. Also, we want log messages from our firewalls and other network infrastructure in case of network issues. We intend to use it for application logging, et cetera, to get insight into internal problems in the applications in Kubernetes pods. We want to use it for monitoring in case of system problems and hardware failures so that it can notify us.
How has it helped my organization?
It's good to have a single location for all the logs. If you have logs coming from a whole lot of sources, it makes it hard to find where the problem lies.
We had to spend a lot of time logging into various systems and pursuing a billion different log files looking for something that stands out as a possible cause of the issue. That can take a lot of time and doesn't give much visibility into the possible interactions between systems.
What is most valuable?
Datadog has a lot of features to be able to drill down deep into the swath of logs that our platforms generate.
It has a lot of ability to make fancy and deep searches using regular expressions and to graph them into useful and interesting dashboard graphs.
The plethora of built-in/downloadable integrations make it much easier to set up for our platforms. Otherwise, we'd have to parse the log files ourselves, which would take a great deal of effort. Had to do it before when had to use an ELK stack for logging, which was painful.
What needs improvement?
Some of the interface is still confusing to use. It has many features, and it takes a lot of effort to figure out what they all mean. Maybe having tooltips or something would be helpful. Also, some of the integrations are better than others.
For how long have I used the solution?
I've used the solution for a month.
What do I think about the stability of the solution?
The solution seems very stable.
Which solution did I use previously and why did I switch?
Have used an ELK stack before. However, it took a lot of effort to maintain, and parsing the logs was difficult.
How was the initial setup?
We implemented the solution in-house.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
SRE at a computer software company with 51-200 employees
Great for log aggregation, searching, and system monitoring
Pros and Cons
- "The ability to easily drill down into log queries quickly and efficiently has helped us to resolve several critical incidents."
- "Datadog could always lower the price!"
What is our primary use case?
We are using Datadog for server metrics, log aggregation and searching, system monitoring, alerting the team about errors, and dashboards for our developers. It's used by the Site Reliability Engineering team and Management of all levels.
It's assisting us in proving SOC II compliance.
We're looking to improve our usage of Datadog's RUM and APM components to get better and more performance insights on our production environments.
We're also looking to leverage more synthetic monitors and runbooks for anyone responding to incidents.
How has it helped my organization?
The ability to easily drill down into log queries quickly and efficiently has helped us to resolve several critical incidents so far this year, and we heavily rely on a series of dashboards showing us various queues and load on CPU and memory for servers.
We also have a view of the information required when we begin the patch and/or upgrade processes.
I've also set up several monitors to alert the Site Reliability Engineering team when various metrics show a server might be reaching capacity. We use it to send an email suggesting we increase the size of the cloud instance.
What is most valuable?
The ability to easily drill down into log queries quickly and efficiently has helped us to resolve several critical incidents. We heavily rely on dashboards that are showing us various queues and load on CPU and memory for servers.
We also have a view of the information required when we begin the patch and/or upgrade processes.
I've arranged several monitors to alert the Site Reliability Engineering team when various metrics show a server that might be reaching capacity. We use it to send an email suggesting we increase the size of the cloud instance.
What needs improvement?
Datadog could always lower the price! In general, more demos online and maybe more free hands-on tutorials for basic functionality would be good for less technical users.
I would also prefer more chances to amend the contract more than twice a year. As a smaller but growing company, it can be difficult to adequately predict demand.
For how long have I used the solution?
I've used the solution for more than three years.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Senior Site Reliability Architect at a tech vendor with 1,001-5,000 employees
Reduces debugging time, with good distributed tracing and useful RUM
Pros and Cons
- "We have hundreds of microservices, and knowing how top-level requests weave throughout all of them is invaluable."
- "There is occasional UI slowness and bugs."
What is our primary use case?
We use Datadog for general observability into our infrastructure, as well as running analytics queries for our SLI/SLO platform. This helps all of our teams be informed of how well their products are actually performing in production, and aim their efforts at the thing that will provide the highest ROI.
We also use it for general monitoring and alerting during load tests and service releases to detect any issues related to the deployments. This helps us maintain our high contractual uptime promises to our clients.
How has it helped my organization?
It has drastically reduced the amount of time we spend on debugging issues and tracking down the root causes of incidents. What might have taken days or hours with separate vendors in the past (or even single vendors with terrible UI) is now quick and easy.
We've often gone from detecting an incident to identifying the needed fix within ten minutes or less and covered multiple domains like APM, Logs, Database performance monitoring, etc., in just a few clicks. This is extremely powerful.
What is most valuable?
Distributed tracing is the most valuable feature. We have hundreds of microservices, and knowing how top-level requests weave throughout all of them is invaluable.
At one glance, we can clearly see which service is slow and then switch over to the infrastructure view or container view to debug why the slowness is happening. This is true of all their other integrated products as well; the more you add, the more insights you get when looking at traces.
We also use RUM extensively. This helps us cover the last mile of application performance. Without it, we wouldn't know if our browser applications were functioning slowly for our users.
What needs improvement?
There is occasional UI slowness and bugs. While the Datadog UI is generally miles above its competitors, there are a few cases where it falls short or has started to slow down over time. They also occasionally make poor UI redesign choices. They should continue focusing on this area to maintain the high standard they started out with.
For how long have I used the solution?
I've used the solution for five years.
What do I think about the stability of the solution?
We've never had major stability issues.
What do I think about the scalability of the solution?
Scalability has never been an issue, although there is occasionally UI slowness.
How are customer service and support?
Support via tickets is absolutely terrible. It's the one obvious bad spot for Datadog. If we didn't have direct relationships with many of their product managers, our experience would be much worse.
How would you rate customer service and support?
Negative
Which solution did I use previously and why did I switch?
We previously used New Relic. It had a terrible UI and the integration between products was not great. Datadog is miles ahead of them and is continuing to increase that distance.
How was the initial setup?
The initial setup is straightforward, and the docs are done well.
What about the implementation team?
We managed the implementation in-house.
What was our ROI?
Our ROI is high.
What's my experience with pricing, setup cost, and licensing?
I'd advise users to negotiate rates. Datadog's off-the-shelf rates are pretty high.
Which other solutions did I evaluate?
We have only used and looked into New Relic.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Datadog Report and get advice and tips from experienced pros
sharing their opinions.
Updated: February 2026
Product Categories
Cloud Monitoring Software Application Performance Monitoring (APM) and Observability Network Monitoring Software IT Infrastructure Monitoring Log Management Container Monitoring AIOps Cloud Security Posture Management (CSPM) AI ObservabilityPopular Comparisons
Wazuh
SentinelOne Singularity Cloud Security
Zabbix
Dynatrace
Splunk Enterprise Security
Microsoft Defender for Cloud
Snyk
Prisma Cloud by Palo Alto Networks
Darktrace
IBM Security QRadar
New Relic
Splunk AppDynamics
Elastic Security
PRTG Network Monitor
Buyer's Guide
Download our free Datadog Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Datadog vs ELK: which one is good in terms of performance, cost and efficiency?
- Any advice about APM solutions?
- Which would you choose - Datadog or Dynatrace?
- What is the biggest difference between Datadog and New Relic APM?
- Which monitoring solution is better - New Relic or Datadog?
- Do you recommend Datadog? Why or why not?
- How is Datadog's pricing? Is it worth the price?
- Anyone switching from SolarWinds NPM? What is a good alternative and why?
- Datadog vs ELK: which one is good in terms of performance, cost and efficiency?
- What cloud monitoring software did you choose and why?
















