Try our new research platform with insights from 80,000+ expert users

Amazon MSK vs Cloudera DataFlow comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 17, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Amazon MSK
Ranking in Streaming Analytics
5th
Average Rating
7.4
Reviews Sentiment
7.1
Number of Reviews
11
Ranking in other categories
No ranking in other categories
Cloudera DataFlow
Ranking in Streaming Analytics
14th
Average Rating
7.4
Reviews Sentiment
6.5
Number of Reviews
5
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of April 2025, in the Streaming Analytics category, the mindshare of Amazon MSK is 7.7%, down from 9.7% compared to the previous year. The mindshare of Cloudera DataFlow is 0.9%, down from 1.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Streaming Analytics
 

Featured Reviews

FNU AKSHANSH - PeerSpot reviewer
Streamlines our processes, and we don't need to configure any VPCs; it's automatic
We don't have many use cases involving ingesting large amounts of data and scaling up and down. We have a clear understanding of our data volume, which remains relatively constant throughout the week. While we're aware of other features Amazon MSK offers, we feel confident in our current setup. If our requirements change significantly in the future, we'll reassess our needs and consider adopting Amazon MSK.
Mohamed-Saied - PeerSpot reviewer
Efficient data integration and workflow scheduling elevate project performance
Cloudera DataFlow is used as an ETL or ELT solution within Cloudera's data pipeline. Our organization heavily relies on it for data ingestion, transformation, and warehousing. It is also used daily for operational tasks, and it integrates well within Cloudera's ecosystem for high performance and…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Amazon MSK's scalability is very good."
"It is a stable product."
"It offers good stability."
"The most valuable feature of Amazon MSK is the integration."
"Amazon MSK has significantly improved our organization by building seamless integration between systems."
"The scalability and usability are quite remarkable."
"The solution's technical support was helpful."
"It provides installations, scaling, and other functionalities straight out of the box."
"The initial setup was not so difficult"
"DataFlow's performance is okay."
"This solution is very scalable and robust."
"Cloudera DataFlow is fully compatible with Cloudera's ecosystem and offers high efficiency through native connectors for various ecosystems."
"The most effective features are data management and analytics."
 

Cons

"It does not autoscale. Because if you do keep it manually when you add a note to the cluster and then you register it, then it is scalable, but the fact that you have to go and do it, I think, makes it, again, a bit of some operational overhead when managing the cluster."
"Amazon MSK could improve on the features they offer. They are still lagging behind Confluence."
"The configuration seems a little complex and the documentation on the product is not available."
"It should be more flexible, integration-wise."
"In my opinion, there are areas in Amazon MSK that could be improved, particularly in terms of configuration. Initially setting it up and getting it connected was quite challenging. The naming conventions for policies were updated by AWS, and some were undocumented, leading to confusion with outdated materials. It took us weeks of trial and error before discovering new methods through hidden tutorials and official documentation."
"It would be really helpful if Amazon MSK could provide a single installation that covers all the servers."
"The cost of using Amazon MSK is high, which is a significant disadvantage, as the increase in cloud costs by 50% to 60% does not justify the savings."
"One of the reasons why we prefer Kafka is because the support is a little bit difficult to manage with Amazon MSK."
"It's an outdated legacy product that doesn't meet the needs of modern data analysts and scientists."
"Cloudera DataFlow's UI interface could be enhanced significantly. Memory handling can also be improved to be better than it is today."
"It is not easy to use the R language. Though I don't know if it's possible, I believe it is possible, but it is not the best language for machine learning."
"Although their workflow is pretty neat, it still requires a lot of transformation coding; especially when it comes to Python and other demanding programming languages."
 

Pricing and Cost Advice

"The price of Amazon MSK is less than some competitor solutions, such as Confluence."
"The platform has better pricing than one of its competitors."
"When you create a complete enterprise-driven architecture that is deployable on an enterprise scale, I would say that the prices of Amazon MSK and Confluent Platform become comparable."
"DataFlow isn't expensive, but its value for money isn't great."
report
Use our free recommendation engine to learn which Streaming Analytics solutions are best for your needs.
847,862 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
22%
Computer Software Company
17%
Manufacturing Company
6%
Retailer
6%
University
17%
Computer Software Company
16%
Financial Services Firm
14%
Media Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Amazon MSK?
Amazon MSK has significantly improved our organization by building seamless integration between systems.
What needs improvement with Amazon MSK?
The cost of using Amazon MSK is high, which is a significant disadvantage, as the increase in cloud costs by 50% to 60% does not justify the savings. There were no other notable issues.
What is your primary use case for Amazon MSK?
We used Amazon MSK to manage high-volume third-party data entering our system. It served as a buffer when our system was unable to consume data at high speeds in real-time. The data initially went ...
What do you like most about Cloudera DataFlow?
The most effective features are data management and analytics.
What needs improvement with Cloudera DataFlow?
Cloudera DataFlow's UI interface could be enhanced significantly. Memory handling can also be improved to be better than it is today.
What is your primary use case for Cloudera DataFlow?
Cloudera DataFlow is used as an ETL or ELT solution within Cloudera's data pipeline. Our organization heavily relies on it for data ingestion, transformation, and warehousing. It is also used daily...
 

Also Known As

Amazon Managed Streaming for Apache Kafka
CDF, Hortonworks DataFlow, HDF
 

Overview

 

Sample Customers

Expedia, Intuit, Royal Dutch Shell, Brooks Brothers
Clearsense
Find out what your peers are saying about Amazon MSK vs. Cloudera DataFlow and other solutions. Updated: March 2025.
847,862 professionals have used our research since 2012.