Try our new research platform with insights from 80,000+ expert users
Architect at Agence Française de Développement
Real User
Top 5
With phenomenal scalability, the setup phase needs to be made easier
Pros and Cons
  • "It is a stable solution...A lot of my experience indicates that Apache Kafka is scalable."
  • "The solution's initial setup process was complex."

What is our primary use case?

We use Kafka for Elastic Stack and Kafka SCRAM login.

I have many users of Apache Kafka. It's like a subject to study in enterprises. However, we have not decided if the systems should generalize Apache Kafka for every application and every IT system.

What is most valuable?

We use Kafka for mapping and ThoughtSpot data from one IT system source to the destination. We also prefer it to exchange data from our internal IT systems.


What needs improvement?

Kafka is a new method we opted to apply to our need for data exchange. Also, we use the solution's integration capabilities.

Irovement-wise, I would like the solution to have more integration capabilities. Also, the solution's setup, which is currently complex, should be made easier.


For how long have I used the solution?

I have experience with Apache Kafka.

Buyer's Guide
Apache Kafka
February 2025
Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.

What do I think about the stability of the solution?

It is a stable solution.

What do I think about the scalability of the solution?

A lot of my experience indicates that Apache Kafka is scalable. We can have ten or even fifty hundred users on the solution. So, it's possible because we are a big enterprise.

How are customer service and support?

I have experience with Apache Kafka's technical support.


How was the initial setup?

The solution's initial setup process was complex. The deployment process took three or four years.

Right now, I can't deliver the planning process required for deployment.

For deployment and maintenance, we have a manager and an operational person. However, I can't give an exact count of the people required for deployment and maintenance.

What other advice do I have?

To be able to recommend Kafka to others, especially considering every context, we will have to set a benchmark and compare Kafka with other tools.

I rate the overall solution a seven out of ten.


Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Silvio Lucas Pereira Filho - PeerSpot reviewer
Senior Tech Lead at RecargaPay
Real User
Useful customization flexibility, processes multiple requests simultaneously, and reliable
Pros and Cons
  • "We appreciate the ability to persistently and quickly write data, as well as the flexibility to customize it for multiple customers. Additionally, we like the ability to retain data within Apache Kafka and use features, such as time travel to access past customer data. The connection with other systems, such as Apache Kafka and IBM DB2."
  • "Apache Kafka can improve by adding a feature out of the box which allows it to deliver only one message."

What is our primary use case?

We are using Apache Kafka to extract data from a Portuguese data source, utilizing an open-source project for data capture. The connector for this project is linked to both Kafka and Confluence platforms. We then transform the extracted data and store it in Elasticsearch.

What is most valuable?

We appreciate the ability to persistently and quickly write data, as well as the flexibility to customize it for multiple customers. Additionally, we like the ability to retain data within Apache Kafka and use features, such as time travel to access past customer data. The connection with other systems, such as Apache Kafka and IBM DB2. 

What needs improvement?

Apache Kafka can improve by adding a feature out of the box which allows it to deliver only one message.

For how long have I used the solution?

I have used Apache Kafka within the last 12 months.

What do I think about the stability of the solution?

Apache Kafka is a stable solution.

What do I think about the scalability of the solution?

The scalability of Apache Kafka is good. It can process many requests simultaneously.

We have approximately 600 people using this solution in my organization.

How are customer service and support?

I have not contacted the support from Apache Kafka.

How was the initial setup?

The initial setup is relatively easy as I am using Docker and the files provided by Confluent. However, setting up Apache Kafka in a production environment is not as straightforward. I prefer to use solutions, such as Confluence that already have everything preconfigured. As a developer, creating an environment for it is not a problem for me, but I think it can be challenging for those responsible for the production environment. There have been issues with data loss and other problems in the past. Configuring it for production is not easy.

My deployment was very quick because I am using it locally. We have someone else that does the cloud deployment.

What about the implementation team?

I did our local implementation and we have someone else that does the cloud deployment.

What's my experience with pricing, setup cost, and licensing?

The price of Apache Kafka is good.

I rate the price of Apache Kafka an eight out of ten.

What other advice do I have?

I don't see any major issues with using Apache Kafka. Many companies use it and it's a good solution. My advice would be to use it as a software-as-a-service rather than setting up your own cluster. This way, you can benefit from a preconfigured and maintained platform. It's better to opt for a software-as-a-service solution.

I rate Apache Kafka an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Apache Kafka
February 2025
Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
Nor EL MALKI - PeerSpot reviewer
Project Manager at Leyton & Associés, SAS
Real User
Simple to scale, high performance, and low maintenance
Pros and Cons
  • "The most valuable feature of Apache Kafka is the clustering which is very easy to scale and we have multiple servers all over our platforms. It has been useful for stability and performance."
  • "Apache Kafka can improve by providing a UI for monitoring. There are third-party tools that can do it, but it would be nice if it was already embedded within Apache Kafka."

What is our primary use case?

We have a scalable architecture where we need multiple workers to handle some processing. To make it possible, the backend catches the request and puts it in a common medium, which is the queue of Apache Kafka. The workers then can share and process it.

What is most valuable?

The most valuable feature of Apache Kafka is the clustering which is very easy to scale and we have multiple servers all over our platforms. It has been useful for stability and performance.

What needs improvement?

Apache Kafka can improve by providing a UI for monitoring. There are third-party tools that can do it, but it would be nice if it was already embedded within Apache Kafka.

For how long have I used the solution?

I have been using Apache Kafka for approximately two years.

What do I think about the stability of the solution?

Apache Kafka is stable. We have not had any issues.

What do I think about the scalability of the solution?

the scalability of Apache Kafka is good. We have parts of the information we use in different geographical sites and it doesn't pose any problem.

How are customer service and support?

I have not used technical support.

Which solution did I use previously and why did I switch?

I previously used RabbitMQ. We switched because Apache Kafka was more stable and had better performance.

How was the initial setup?

The initial setup of Apache Kafka was easy because it is Dockerized. However, if you were to install it yourself it would be difficult. Having it Dockerized makes it worth it. 

The first deployment took approximately two hours. The updates of the solution can be done in a matter of minutes.

What about the implementation team?

Our DevOps team in our IT department did the deployment of the solution. It was mostly virtual work. The maintenance of the solution does not take a lot of time.

What's my experience with pricing, setup cost, and licensing?

We are using the free version of Apache Kafka.

What other advice do I have?

We had a good experience with the solutions, the maintainability and scalability are good. I would recommend the solution to others.

I rate Apache Kafka a nine out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Lucas Dreyer - PeerSpot reviewer
Data Engineer at BBD
Real User
Top 5Leaderboard
A distributed event store and stream-processing platform to build real-time streaming data pipelines and applications

What is our primary use case?


We use Apache Kafka to process messages, specifically payment type messages, and incorporate the data from those messages into our analytics and reporting. It utilizes data from additional sources in real-time for our analytics and reporting purposes.

What is most valuable?

The real-time nature and the ability to use multiple offsets are the most beneficial features of Apache Kafka for our data streaming needs. This allows us to replay the same messages using different offsets. Although I haven't set up Kafka's scalability and fault tolerance myself, I know it can be configured with redundancy and fallback options. We primarily consume the messages using different clients, so the setup for fault tolerance and redundancy is transparent to us.

It's quite flexible and comparable to other solutions like ActiveMQ in terms of features and guarantees, especially with offsets for message handling. While ActiveMQ may be preferred in some use cases requiring guaranteed message delivery, Kafka's offset management provides similar functionality. Overall, I would recommend Kafka for real-time data streaming without hesitation.

What needs improvement?

The main challenge we faced while integrating Apache Kafka with other tools was setting up SSL and securing connections. Managing certificate changes and ensuring all clients connect smoothly, especially outside Kubernetes environments, posed ongoing challenges. Once initially set up, maintaining and sharing these security configurations became more manageable, but ensuring compatibility across different environments remained a continuous effort.

For how long have I used the solution?

I have been using Apache Kafka for the last five years.

What do I think about the stability of the solution?


I would rate the stability nine out of ten.

What do I think about the scalability of the solution?

I would rate the scalability nine out of ten.

How are customer service and support?

The technical support, typical for open-source solutions, is also responsive and helpful.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

We switched to Kafka from paid solutions like IBM's MQ due to cost considerations, finding Kafka's multiple offsets and popularity advantageous.

How was the initial setup?


Installation is straightforward, taking less than an hour on Linux, though more complex setups like failover can require more effort.

What was our ROI?


Apache Kafka isn't a major part of our processing yet. Much of our processing is batch processing with data from APIs and other sources. So, it hasn't contributed significantly to return on investment. However, in other areas where we use Kafka extensively for data processing before persisting the data, it has provided quite a bit of return on investment.

What's my experience with pricing, setup cost, and licensing?


As for pricing, Kafka is open-source, so it's free to install and use.

What other advice do I have?


I rate Apache Kafka a nine out of ten for its performance, features, and community support.

Which deployment model are you using for this solution?

Hybrid Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
PeerSpot user
reviewer2150616 - PeerSpot reviewer
Lead Data Scientist at a transportation company with 51-200 employees
Real User
Top 5
Offers a free version but needs to improve the support offered to users
Pros and Cons
  • "The most valuable features of the solution revolve around areas like the latency part, where the tool offers very little latency and the sequencing part."
  • "One complexity that I faced with the tool stems from the fact that since it is not kind of a stand-alone application, it won't integrate with native cloud, like AWS or Azure."

What is our primary use case?

I was planning to use the tool for real-time analysis in terms of data processing and real-time analytics workflows. The real-time IoT data comes through with a few challenges, and that is for one time, so it is more like a Kafka topic. I want to actually use multiple Kafka topics where one of them can be directly fed into the data pipeline, another one can be fed into the real-time alert system, and the next one can be fed into machine learning.

How has it helped my organization?

The most valuable features of the solution revolve around areas like the latency part, where the tool offers very little latency and the sequencing part. The sequencing part actually helps to aggregate things in a way that I don't need to write another function or kind of sequence it, and I write an aggregate function to figure out the maximum value in the last ten samples.

What needs improvement?

One complexity that I faced with the tool stems from the fact that since it is not kind of a stand-alone application, it won't integrate with native cloud, like AWS or Azure. Apache Kafka has another mask on it, so if users can have a direct service, like Grafana, that can actually be used as a stand-alone tool with Grafana cloud, or you can use a mix of AWS and Grafana, so there is not much difference with it. I expect Apache Kafka to have Grafana's same nature.

The product's support and the cloud integration capabilities are areas of concern where improvements are required.

For how long have I used the solution?

I have been using Apache Kafka for a year.

What do I think about the stability of the solution?

Stability-wise, I rate the solution an eight out of ten.

What do I think about the scalability of the solution?

Scalability-wise, I rate the solution an eight out of ten.

Around four people in my company use the product.

How are customer service and support?

I did not interact much with the product technical support team. I did not have dedicated support that responded to all my queries since I was using the product's free version. I rate the support a seven out of ten.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

I have worked with Databricks. I use Databricks and Apache Kafka simultaneously.

How was the initial setup?

The product's deployment phase is neither complex nor straightforward. As the software has evolved a lot, users can actually keep it even simpler by opting for a plug-and-play model.

The solution is deployed on an on-premises model.

The solution can be deployed in two or three days.

What about the implementation team?

I was involved with the tool's installation process.

What was our ROI?

I cannot comment on the tool's ROI since I did not use it for production purposes.

What's my experience with pricing, setup cost, and licensing?

I was using the product's free version.

What other advice do I have?

I did not come across any scenarios involving fault tolerance because when it comes to the issue data consistency issues, like missing or incorrect value of data are actually part of the system where the data is being fed. Nevertheless here, when it comes to the missing values, I never tried the option, especially whenever a value is missing, that can allow one to impute the value with another parameter.

Speaking about if I incorporated any emerging data stream streaming trends in Apache Kafka workflows, for example, utilization of AI, I would say that I use it as a local system, so if I have an EC2 server where I kind of read the sample and then populate the regression and reintegration model on top of it, but that is done locally and not on the cloud.

I recommend the product to those who plan to use it. I like Kafka and Flink, and I want to actually create a system in AWS mainly for real-time streaming so that I don't need to worry about multiple data copies.

Considering the improvements needed in the product's support, and the cloud integration capabilities, while looking at the simplicity during the installation phase, I rate the tool a seven out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Mukulit Bhati - PeerSpot reviewer
CTO at InsightGeeks Solutions Pvt.
Real User
Impeccable and impressive throughput with brilliant availability
Pros and Cons
  • "Its availability is brilliant."
  • "The support on Apache Kafka could be improved."

What is our primary use case?

We use Apache Kafka for patching real-time data that we receive over a data transport layer and for putting the data into Apache Kafka. From Apache Kafka, we use several applications to subscribe to topics from different applications that we serve directly to browsers. Additionally, we use these applications inside our solution and have Apache Kafka Stream, which is connected to MongoDB.

Since we receive data in real-time consisting of IoT devices, running vehicles, their locations, their states, and their VNs, the solution is helpful.

What needs improvement?

The product could be improved with proper documentation. Proper documentation should be the SSE. We have a challenge with configuration, so it isn't easy to configure a standalone Apache Kafka on the premises. It needs to be set up on-premises and surveys being provided in the market want to be excluded. Hence, being a developer and configuring Apache Kafka is very hard. It is user-friendly, but initially, we found it challenging. Improving the documentation in this solution would be much better if documents were provided on GitHub for different things. As the market is growing, Spring solution is working hard to get products in the market so when Python, React JS, and Node.Js came, they were lacking. But today, Spring Boot has a solid framework. So the support on Apache Kafka could be improved, but finding some configurations with Spring Boot isn't easy.

For how long have I used the solution?

We have been using this solution for over three years and are currently using the latest version.

What do I think about the stability of the solution?

The solution is stable, and the most fantastic thing about it is its throughput. For example, I have tried MQs, which also have Apache Kafka Streams. So the throughput of Apache Kafka Stream is impeccable and impressive.

What do I think about the scalability of the solution?

The solution is very scalable, and its availability is brilliant. We have approximately 32,000 people on our customer base.

How are customer service and support?

We do not have any experience with customer service and support.

Which solution did I use previously and why did I switch?

We have tried different MQs, but the subscription and charting available on this solution are better. We have used Queues previously, but this solution is more stable, so we chose it.

How was the initial setup?

The initial setup is dependent on the individual. For example, it would be straightforward if a person practices these things a lot and understands the documentation correctly. However, since most people prefer examples instead of reviewing documentation, it would be easy to set up if they find steps on the internet but difficult if they do not have examples.

What's my experience with pricing, setup cost, and licensing?

I rate the pricing for this solution an eight out of ten. It could be a bit cheaper.

What other advice do I have?

I rate this solution an eight out of ten. It is good, but the documentation could be improved.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Barista Brewing Espresso at Linkedln
Real User
Great horizontal scaling, design with library simplicity
Pros and Cons
  • "Good horizontal scaling and design."
  • "Lacks elasticity and the ability to scale down."

What is our primary use case?

Our primary use case of this solution is for data integration and for real-time data consumption. I'm a senior staff engineer for data and infrastructure and we are customers of Apache. 

What is most valuable?

I love the simplicity of the library and the design as well as the architectural concept which is like horizontal scaling.

What needs improvement?

When compared to other commercial competitors, Kafka doesn't have the ability to scale down, the elasticity is lacking in the product. The other issue for us is the delayed queue, which was available to us in the commercial software but not in Kafka. It's something we use in most of our applications for deferred processing and I know it's available in other solutions. I'd like to see some tooling support and language support in the open source version. 

For how long have I used the solution?

I've been using this solution for four years.

What do I think about the stability of the solution?

The stability is good. 

What do I think about the scalability of the solution?

The solution scales horizontally and scales better than its competitors. We have around 400 to 500 microservices consuming this cluster and the company has around 600 employees. We have four different verticals, each with around 100 engineers with 100 to 150 microservices. 90% of the microservices have a touchpoint with Kafka.

How are customer service and support?

I think the community is very good and will respond if you raise a ticket. We also use external third-party libraries that were built in GitHub. It would be good to have some direct support from Apache.

Which solution did I use previously and why did I switch?

Four years ago we were using Rabbit MQ but we switched to Kafka because Rabbit was designed for a very narrow use case. It became difficult for us to run and maintain that server and our client libraries. We had a huge outage, so we shifted to Kafka because of the simplicity in the architecture.

How was the initial setup?

The initial setup was simple although we had a couple of hiccups. It took around a week but that was several years ago and we haven't had any problems since. Our team carried out the deployment and we currently have a few engineers who deal with maintenance. 

What's my experience with pricing, setup cost, and licensing?

We are currently using the open-source version. 

What other advice do I have?

There is room for improvement with this solution so I rate it eight out of 10. 

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Teodor Muraru - PeerSpot reviewer
Developer at Emag
Real User
Top 5
Reliable solution for processing broker messages from many clients
Pros and Cons
  • "The most valuable feature is the messaging function and reliability."
  • "Something that could be improved is having an interface to monitor the consuming rate."

What is our primary use case?

I have a lot of messages, and we need to process those messages from many clients. Each client takes those messages and processes them.

I'm using the brokerage partner. I'm not storing or maintaining the application on servers. I'm just a client for the Apache Kafka server.

The solution is deployed on-prem.

How has it helped my organization?

Apache Kafka has improved our organization because it's more reliable than Rabbit. That's the whole point for us.

What is most valuable?

The most valuable feature is the messaging function and reliability.

What needs improvement?

Something that could be improved is having an interface to monitor the consuming rate. We use something, but I'm not sure if it's from Apache Kafka, or if it's a borrowed third-party solution. So, the interface for monitoring the processes is an additional feature that could be added.

For how long have I used the solution?

I have been using this solution for two years.

What do I think about the stability of the solution?

The solution is pretty stable compared to Rabbit or other brokers. 

What do I think about the scalability of the solution?

The solution is scalable. We have about 10 departments that use Kafka in various forms. Each department might have 5 or 10 people.

We use the solution all the time. We have consumers that consume messages that come every day because we have clients and customers for the main website. All of those messages go to KAF clients. Our backend departments consume messages from the actions of the final customers.

Which solution did I use previously and why did I switch?

We used Rabbit and we switched to Kafka because it seemed like an upgrade in ability, reliability, and in the consuming process of broker messages.

How was the initial setup?

Implementations took half a year for everyone to learn the solution. It was quite lengthy.

What other advice do I have?

I would rate this solution 9 out of 10.

My advice is to take some time in investigating how to implement the solution.

We used to require about half a year to implement in our organization. Someone who needs to implement Kafka has to be prepared for a quite lengthy process. Don't expect implementation to be completed in a week. It's a little bit longer because it's complex.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user