- Distributed
- Persistence
- Offset management by consumer
Hadoop Technical Lead (Assistant Consultant) at a tech services company with 10,001+ employees
This is the base streaming component of our IoT platform. It needs a separate cluster and a separate administrator.
What is most valuable?
How has it helped my organization?
This is the base streaming component of our IoT platform.
In case of disaster recovery, we mirror the data in the cluster by maintaining the offsets and store the data within Hadoop 2.8 HDFS.
What needs improvement?
- It needs a separate cluster and a separate administrator to manage the Kafka cluster, adding an extra cost.
- It is challenging when data is moved to a mirror cluster, in the case of disaster recovery. It doesn't keep the offset.
For how long have I used the solution?
I have used this solution for one year.
Buyer's Guide
Apache Kafka
February 2025

Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
How are customer service and support?
The open source community is very strong. Also, distributors like Cloudera and Hortonworks provide paid support.
Which solution did I use previously and why did I switch?
For big data, we did not have a previous solution. I have used Microsoft MQ for building traditional systems.
How was the initial setup?
The setup was straightforward.
What's my experience with pricing, setup cost, and licensing?
This is open source with the cost of a cluster administrator.
Which other solutions did I evaluate?
We did not look at anything else. At that time, this was already accepted by the industry for streaming data processing.
What other advice do I have?
If the Hadoop distribution is MapR, then consider MapR Streaming. MapR Streaming has overcome these fundamental issues. It stores data within the MapR-FS itself. So there is extra overhead, but with a licensing cost.
Disclosure: I am a real user, and this review is based on my own experience and opinions.

Founder, CEO at a tech vendor with 1-10 employees
The ability to partition data is valuable. There are far superior and cheaper alternatives in cloud-based solutions
Pros and Cons
- "The ability to partition data on Kafka is valuable."
- "The product is good, but it needs implementation and on-going support. The whole cloud engagement model has made the adoption of Kafka better due to PaaS (Amazon Kinesis, a fully managed service by AWS)."
How has it helped my organization?
We have used Kafka for streaming customer web clicks from live sessions to understand customer behavioral patterns.
What is most valuable?
The ability to partition data on Kafka is valuable. But Kafka needs support and management. It is better to have it fully managed on the cloud.
The only reason I give Kafka as product a low rating is because there are far superior and cheaper alternatives in cloud-based solutions, where we save money on manpower, electricity, servers, datacenters, networking, etc.
In fact, this is the view I have for pretty much all open source software compared to cloud based services. They just make things cheaper, faster, scalable and manageable. Kafka is good, but Kafka as a cloud service is awesome!!
This is a relative rating (compared to cloud services), not that something is wrong with Kafka. I hope that is clear.
What needs improvement?
The product is good, but it needs implementation and on-going support. The whole cloud engagement model has made the adoption of Kafka better due to PaaS (Amazon Kinesis, a fully managed service by AWS).
What do I think about the stability of the solution?
No issues here with stability.
What do I think about the scalability of the solution?
Ah, scalability!!! We need to set up multiple servers again for handling the load, which makes Kafka not scalable, unless you subscribe to cloud services.
How are customer service and technical support?
It’s an Apache-community based support, so it is not really prioritized if you have a business issue. This is why most enterprise customers pay for cloud services.
Which solution did I use previously and why did I switch?
We didn’t have a previous solution. We started with Kafka and then switched to Amazon Kinesis (PaaS for Kafka). I think Microsoft Azure also released a competing service.
How was the initial setup?
The setup was straightforward.
What's my experience with pricing, setup cost, and licensing?
Licensing issues are not applicable. Apache licensing makes it simple with almost zero cost for the software itself.
Which other solutions did I evaluate?
We unsuccessfully, and kind of foolishly, tried Apache Camel. They were not similar in services, so we moved to Kafka rightfully, and then to AWS cloud ultimately.
What other advice do I have?
If you have a dedicated Kafka resource to implement and manage the services, then go for Apache Kafka. Otherwise, do consider cloud-based services from AWS or Azure.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Apache Kafka
February 2025

Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
Enterprice Architect
A reliable message delivery system, but more connectors to different systems are needed
Pros and Cons
- "The most important feature for me is the guaranteed delivery of messages from producers to consumers."
- "More adapters for connecting to different systems need to be available."
What is our primary use case?
I am an enterprise architect involved in Big Data and integration projects using Apache Kafa. We use it for integrating our different management systems.
What is most valuable?
The most important feature for me is the guaranteed delivery of messages from producers to consumers.
What needs improvement?
More adapters for connecting to different systems need to be available.
For how long have I used the solution?
I have been using Kafka for about six months.
What do I think about the stability of the solution?
This is a stable solution and we haven't had any complexities.
What do I think about the scalability of the solution?
This solution is scalable.
Which solution did I use previously and why did I switch?
I have used IBM MQ and it is better in terms of the adapters that are available. However, the price of IBM MQ is very high.
How was the initial setup?
The initial setup is easy.
What's my experience with pricing, setup cost, and licensing?
Kafka is more reasonably priced than IBM MQ.
What other advice do I have?
Although we are deployed on-premises at the moment, we are looking to have a cloud-based deployment in a year or two.
This is a solution that I can recommend but it will take a lot of time to develop the adapters.
I would rate this solution a seven out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Java Architect at a tech vendor with 51-200 employees
The speed at which it publishes messages is valuable.
Pros and Cons
- "Excellent speeds for publishing messages faster."
- "Too much dependency on the zookeeper and leader selection is still the bottleneck for Kafka implementation."
What is most valuable?
Excellent speeds for publishing messages faster.
What needs improvement?
Too much dependency on the zookeeper and leader selection is still the bottleneck for Kafka implementation.
What do I think about the scalability of the solution?
RESTful API implementation actually uses the Kafka Broker to publish the messages but I am not able to find it becoming scalable. Partially, the reason might be there is no load balancer for the RESTful API web server.
How was the initial setup?
Setup is very much straightforward for development, and cluster setup is also easy. I am not aware of the production setup yet.
What about the implementation team?
I implemented it on my own.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Solutions Architect at a tech services company with 201-500 employees
Good support, stable, and it supports a high volume of data
Pros and Cons
- "The most valuable feature is the support for a high volume of data."
- "The initial setup and deployment could be less complex."
What is our primary use case?
We are a solution provider and Apache Kafka is being used in our client's company.
What is most valuable?
The most valuable feature is the support for a high volume of data.
What needs improvement?
The initial setup and deployment could be less complex.
Integration is one of the main concerns that we have.
For how long have I used the solution?
We have been using Apache Kafka for two years.
What do I think about the stability of the solution?
Kafka is a stable product.
What do I think about the scalability of the solution?
This is a scalable solution.
How are customer service and technical support?
The technical support is quite good, and we have no problem with it.
Which solution did I use previously and why did I switch?
We also use IBM MQ. It is also a stable product.
IBM MQ is probably easier to deploy than Kafka.
In addition to these, I have also worked with RabbitMQ.
How was the initial setup?
Deploying Kafka is more complex than IBM MQ.
Which other solutions did I evaluate?
My customer has asked me to choose between IBM MQ and Apache Kafka. I will be comparing these two solutions in the near future. My impression is that Kafka is going to better suit my customer, but I have to consider their specific needs before I can be sure.
What other advice do I have?
This is a solution that I may recommend, but its suitability depends on the needs and requirements.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
SDET II at a tech services company with 5,001-10,000 employees
Replication and partitioning are valuable features.
What is most valuable?
- Replication, partitioning, and reliability are the most valuable features.
- Even if one of my clusters fails, the replication factor of a topic makes sure that I have the data available for processing, so I won't lose any of it.
- Partitioning enables me to process the parallel requests. It helps in reaching the throughput.
What needs improvement?
One improvement is in regards to the OS memory management. In case there are too many partitions, it runs into memory issues. Although this is a very rare scenario, it can happen.
For how long have I used the solution?
I have been using this product for a year now.
What do I think about the stability of the solution?
There were no stability issues.
What do I think about the scalability of the solution?
Kafka is a highly scalable product. We have not faced any scalability issues so far.
How is customer service and technical support?
Since it's an open source product, no technical support is available. However, the open source community is very active.
How was the initial setup?
The initial setup was straightforward. Just go through the Kafka documentation and it will be up and running in no time.
What's my experience with pricing, setup cost, and licensing?
Since it's an open source product, there is no pricing for it.
Disclosure: I am a real user, and this review is based on my own experience and opinions.

Buyer's Guide
Download our free Apache Kafka Report and get advice and tips from experienced pros
sharing their opinions.
Updated: February 2025
Product Categories
Streaming AnalyticsPopular Comparisons
Confluent
PubSub+ Platform
Buyer's Guide
Download our free Apache Kafka Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which ETL tool would you recommend to populate data from OLTP to OLAP?
- What are the differences between Apache Kafka and IBM MQ?
- How do you select the right cloud ETL tool?
- What is the best streaming analytics tool?
- What are the benefits of streaming analytics tools?
- What features do you look for in a streaming analytics tool?
- When evaluating Streaming Analytics, what aspect do you think is the most important to look for?
- Why is Streaming Analytics important for companies?