Try our new research platform with insights from 80,000+ expert users
it_user653562 - PeerSpot reviewer
Solutions Architect at a consultancy with 1,001-5,000 employees
Consultant
May 10, 2017
Has the ability to write data at one velocity and have subscribing consumers read at different velocities.
Pros and Cons
  • "Apache Kafka is actually a distributed commit log. That is different than most messaging and queuing systems before it."
  • "When starting out with minimal hardware, Kafka has a guaranteed delivery mechanism that is very easy to set up and can handle very large data volumes, helping to speed up the timeline from prototype all the way to production volumes."
  • "The GUI tools for monitoring and support are still very basic and not very rich. There is no help in determining a shard key for performance."
  • "The GUI tools for monitoring and support are still very basic and not very rich."

How has it helped my organization?

Kafka has a guaranteed delivery mechanism that is very easy to set up. When starting out with minimal hardware, it can handle very large data volumes. When prototyping and creating a proof of concept, Kafka has helped to speed up the timeline from the prototype all the way to production volumes.

What is most valuable?

Apache Kafka is actually a distributed commit log. That is different than most messaging and queuing systems before it. I find the ability to write data at one velocity and have subscribing consumers read at different velocities to be the best feature.

What needs improvement?

The GUI tools for monitoring and support are still very basic and not very rich. There is no help in determining a shard key for performance.

What do I think about the stability of the solution?

We did not have any issues with stability.

Buyer's Guide
Apache Kafka
March 2026
Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
884,976 professionals have used our research since 2012.

What do I think about the scalability of the solution?

We did not have any issues with scalability.

How are customer service and support?

  • Kafka is open source from LinkedIn and support comes from the community of users.
  • You can go with Confluent, the company that was founded by the original engineers from LinkedIn.
  • You can go with a cloud hosting service, like AWS EMR or Azure HDInsight.


    Which solution did I use previously and why did I switch?

    We used traditional message queues and file semaphores. There was a lot of overhead with asynchronous messages being put into an order and making sure nothing got dropped. It required a lot of code and maintenance.

    How was the initial setup?

    Since it is open source, you are on your own for setup. However, the tutorials from the Apache foundation and online sources have been an immense help.

    Getting started is very easy. The complexity of very large volumes of data and appropriate sharding, however, is difficult. There are fewer resources for tuning and best practices.

    What's my experience with pricing, setup cost, and licensing?

    When starting to look at a distributed message system, look for a cloud solution first. It is an easier entry point than an on-premises hardware solution. A lot of the complexity has already been taken care of. Both AWS and Azure have supported Kafka clusters that can be provisioned very easily.

    Which other solutions did I evaluate?

    We looked at RabbitMQ and Spark Streaming.

    What other advice do I have?

    Be sure to define the use cases as best as possible at first.

    Kafka is very good, but it is complex to support. It can handle any message size, whereas native cloud options have size limitations.

    Be sure to understand what messages will be sent and how many discrete topics will be needed.

    Be aware that you must code both producers and consumers.

    The bulk of the work is with the consumer.

    The Apache stack for Kafka is very open source. There are essentially no tools other than command line options to monitor brokers and topic health. So there are 3rd party tools that will help with that, some free, some paid – but it requires that you install agents on the servers hosting Kafka and open up ports for netbeans on the scripts that start up the Kafka services. Additionally, you also have to monitor zookeeper – which is very memory intensive. Cloud offerings that provide the whole modern data architecture stack – like AWS EMR and Azure HDInsight as well as Hortonworks and Cloudera provide a console GUI as part of each of their offerings. Also Confluent, a company founded by the Linked-In engineers that designed Kafka, also have a paid enterprise offering that has much better tools for maintain the kafka cluster. But apache Kafka with the community – you are on your own.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    it_user660630 - PeerSpot reviewer
    SDET II at a tech services company with 5,001-10,000 employees
    Consultant
    May 10, 2017
    Replication and partitioning are valuable features.
    Pros and Cons
    • "Kafka is a highly scalable product."
    • "One improvement is in regards to the OS memory management."

    What is most valuable?

    • Replication, partitioning, and reliability are the most valuable features.
    • Even if one of my clusters fails, the replication factor of a topic makes sure that I have the data available for processing, so I won't lose any of it.
    • Partitioning enables me to process the parallel requests. It helps in reaching the throughput.

    What needs improvement?

    One improvement is in regards to the OS memory management. In case there are too many partitions, it runs into memory issues. Although this is a very rare scenario, it can happen.

    For how long have I used the solution?

    I have been using this product for a year now.

    What do I think about the stability of the solution?

    There were no stability issues.

    What do I think about the scalability of the solution?

    Kafka is a highly scalable product. We have not faced any scalability issues so far.

    How is customer service and technical support?

    Since it's an open source product, no technical support is available. However, the open source community is very active.

    How was the initial setup?

    The initial setup was straightforward. Just go through the Kafka documentation and it will be up and running in no time.

    What's my experience with pricing, setup cost, and licensing?

    Since it's an open source product, there is no pricing for it.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    Buyer's Guide
    Apache Kafka
    March 2026
    Learn what your peers think about Apache Kafka. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
    884,976 professionals have used our research since 2012.
    it_user647457 - PeerSpot reviewer
    Head of Engineering
    Vendor
    May 10, 2017
    Interactions among micro-services are used as input to our analytics infrastructure.
    Pros and Cons
    • "Ease of use."
    • "Kafka was at the base of our system architecture."
    • "Stability of the API and the technical support could be improved."
    • "Stability of the API and the technical support could be improved."

    How has it helped my organization?

    Kafka was at the base of our system architecture. The system was designed as an event based architecture. Almost all the interactions among micro-services and the same data are used as input to our analytics infrastructure.

    What is most valuable?

    • Scalability
    • Reliability
    • Ease of use

    What needs improvement?

    Stability of the API and the technical support could be improved.

    The Kafka API is changing quite radically with the different releases. There are many new improvements and that's good. But the inherent cost of adapting to a new version of the platform was worrying me at the time.

    The documentation was sometimes misleading, since it was describing some feature in the new version of the API rather than the one we were using.

    What do I think about the stability of the solution?

    We did not encounter any issues with stability.

    What do I think about the scalability of the solution?

    We did not encounter any issues with scalability.

    How are customer service and technical support?

    We were not completely satisfied with the technical support. We subscribed to the Confluent professional platform to receive guidance and support on development and deployment. Whilst the development side is quite well covered by their consultants, the deployment and administration is not at the same level.

    Which solution did I use previously and why did I switch?

    The previous solution was not really an equivalent one. I have been using several messaging systems, but Kafka fits us better for a more scalable system.

    How was the initial setup?

    The initial setup was straightforward.

    What's my experience with pricing, setup cost, and licensing?

    I would not subscribe to the Confluent platform, but rather stay on the free open source version. The extra cost wasn't justified.

    Which other solutions did I evaluate?

    We didn't evaluate other options, as we already had a positive experience across the team with Kafka. Everybody agreed to work with it.

    We were considering Kinesis too, since we were running on AWS. We preferred to opt for a tool with which people were more familiar.

    What other advice do I have?

    The product is easy to use. However, to leverage its power, there is a need for good knowledge of event based processing. I suggest using the massive amount of material shared by the Confluent team, or what is available online.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    PeerSpot user
    Deputy General Manager, DevOps Manager at a comms service provider with 10,001+ employees
    Real User
    Apr 13, 2017
    One of the best features which I have worked with is replay.
    Pros and Cons
    • "One of the best features which I have worked with is replay."
    • "GUI for Kafka infrastructure monitoring and deployment"

    What is most valuable?

    One of the best features which I have worked with is replay.

    How has it helped my organization?

    Real-time log aggregation which was earlier done with rsync has been moved to Kafka infrastructure along with other real-time streams.

    What needs improvement?

    • GUI for Kafka infrastructure monitoring and deployment

    For how long have I used the solution?

    I have used it for two years.

    What was my experience with deployment of the solution?

    Documentation is quite comprehensive.

    What do I think about the stability of the solution?

    I found it very stable.

    What do I think about the scalability of the solution?

    No issues with scalability.

    How are customer service and technical support?

    Customer Service:

    We used the open-source version.

    Technical Support:

    We used the open-source version.

    Which solution did I use previously and why did I switch?

    We previously used rsync, which was not real-time.

    How was the initial setup?

    Initial setup was mostly intuitive (based on rsync).

    What about the implementation team?

    Implementation was in-house based on the open-source version.

    What was our ROI?

    Target was to achieve real-time service.

    Which other solutions did I evaluate?

    Before choosing this product, we did not evaluate other options.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    PeerSpot user
    Java Architect at a tech vendor with 51-200 employees
    Vendor
    Apr 13, 2017
    The speed at which it publishes messages is valuable.
    Pros and Cons
    • "Excellent speeds for publishing messages faster."
    • "Excellent speeds for publishing messages faster."
    • "Too much dependency on the zookeeper and leader selection is still the bottleneck for Kafka implementation."
    • "Too much dependency on the zookeeper and leader selection is still the bottleneck for Kafka implementation."

    What is most valuable?

    Excellent speeds for publishing messages faster.

    What needs improvement?

    Too much dependency on the zookeeper and leader selection is still the bottleneck for Kafka implementation.

    What do I think about the scalability of the solution?

    RESTful API implementation actually uses the Kafka Broker to publish the messages but I am not able to find it becoming scalable. Partially, the reason might be there is no load balancer for the RESTful API web server.

    How was the initial setup?

    Setup is very much straightforward for development, and cluster setup is also easy. I am not aware of the production setup yet.

    What about the implementation team?

    I implemented it on my own.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    it_user592338 - PeerSpot reviewer
    Enterprise Architect at a logistics company with 1,001-5,000 employees
    Real User
    Jan 25, 2017
    We use it for reactive architecture, track and trace, mail and parcel.
    Pros and Cons
    • "Supports more than 10,000 events/second, scalability, and replication, and it is a good product for event-driven architecture."

      What is most valuable?

      • Supports more than 10,000 events/second.
      • Scalability
      • Replication

      It is a good product for event-driven architecture.

      How has it helped my organization?

      We use Kafka for reactive architecture, track and trace, mail and parcel.

      What needs improvement?

      A good free monitor tool would be great for Apache Kafka (from Apache foundation).

      For how long have I used the solution?

      We used Kafka 0.8 for 2 years and Kafka 0.10 for 3 months.

      What do I think about the stability of the solution?

      We have not encountered any stability issues.

      What do I think about the scalability of the solution?

      We have not encountered any scalability issues.

      How are customer service and technical support?

      We haven’t used technical support.

      Which solution did I use previously and why did I switch?

      Apache MQ is different. It is a message bus (log rotate) than can manage more than 10,000 events/sec.

      How was the initial setup?

      The basic configuration is quite good. We have built a Hadoop cluster and the Kafka service was included.

      What's my experience with pricing, setup cost, and licensing?

      We use a community version.

      What other advice do I have?

      Kafka processes asynchronous exchanges, so there are no transactional interactions.

      Disclosure: My company does not have a business relationship with this vendor other than being a customer.
      PeerSpot user
      PeerSpot user
      Technical Lead/Project Manager(Consulting Apple Inc) at a tech services company with 1,001-5,000 employees
      Consultant
      Jan 25, 2017
      Topic-based eventing, scalability, and retention periods are valuable.
      Pros and Cons
      • "This is the best tool I have ever used for asynchronous, event-based solutions."
      • "If you are using the same group ID for multiple topics, it may shut down the application."

      What is most valuable?

      The most valuable features are topic-based eventing, scalability, and retention periods.

      How has it helped my organization?

      My organization is transforming by using the new SOA/eventing-based architecture. The application depends on the employees’ information events. Kafka is very helpful in implementing this. It increases the performance and gives the details to multiple external/internal teams using Kafka topics in an asynchronous manner.

      For example, if someone is moving from one office to another one, we have to update the software. While updating it, the system puts that event in a topic so that all other consumers can update that person’s new location. This can include the payroll team, the insurance team, and the hospital network.

      The retention period helps us retain the data in the topic for the configured number of days. In this example, if any of the consumers fail to consume the message from the topic, then that message will be there until the retention period ends.

      What needs improvement?

      I would like to see a more user-friendly GUI.

      For how long have I used the solution?

      We have used this solution since December, 2015.

      What do I think about the stability of the solution?

      If you are using the same group ID for multiple topics, it may shut down the application. We have faced this issue before.

      What do I think about the scalability of the solution?

      We have not had any scalability issues.

      How are customer service and technical support?

      I would give technical support a rating of 6 out of 10.

      Which solution did I use previously and why did I switch?

      We were using ActiveMQ, which is just a messaging system. We are changing because of Kafka’s added value of scalability, retention, and high payload support.

      How was the initial setup?

      The installation was somewhat straightforward.

      What's my experience with pricing, setup cost, and licensing?

      The solution is worth the money.

      What other advice do I have?

      This is the best tool I have ever used for asynchronous, event-based solutions.

      Disclosure: My company does not have a business relationship with this vendor other than being a customer.
      PeerSpot user
      it_user590451 - PeerSpot reviewer
      Lead Engineer at a retailer with 10,001+ employees
      Real User
      Jan 24, 2017
      We use the product for high-scale distributed messaging. Multiple consumers can sync with it and fetch messages.
      Pros and Cons
      • "Some of the clusters churn millions of records per seconds with ease."
      • "This product guarantees at-least-once delivery."

      What is most valuable?

      We use the product for high-scale distributed messaging. The processing capability of the product is enormous. Being a distributed platform, multiple consumers can sync with it and fetch messages.

      Another great feature is the consumer offset log which tells you where the consumer left and where he needs to start again. Consumers aren’t required to code and put extra effort to maintain the offset.

      How has it helped my organization?

      We were using another commercial messaging engine, which was not scalable unless you paid more. Each hub that we provisioned was expensive. This solution is open source, which is much easier to use and doesn’t cost us anything.

      What needs improvement?

      This product guarantees at-least-once delivery. We have asked JIRA to provide features such as at-most-once delivery to remove duplicate message consumption.

      What do I think about the stability of the solution?

      We haven’t faced any issues so far. Some of the clusters churn millions of records per seconds with ease.

      What do I think about the scalability of the solution?

      We have clustered environments and we haven’t seen any scalability issues. We can provision a new node in as little as 45 minutes.

      How are customer service and technical support?

      It is open source, so support is in our own hands. The only option is to make a new feature request through JIRA. When multiple people in the community make a request for similar feature, it gets priority.

      Which solution did I use previously and why did I switch?

      We switched from a previous solution mainly to reduce costs and to have a more scalable solution.

      How was the initial setup?

      The initial setup was a bit complex in terms of how to manage it across data centers. But once it was setup, we never faced issues.

      Which other solutions did I evaluate?

      We evaluated multiple options, such as ActiveMQ and RabbitMQ. We leaned towards this solution.

      What other advice do I have?

      I would advise others to start with non-SSL implementations and try to do PoCs. Afterwards, they should move towards more secure features.

      Disclosure: My company does not have a business relationship with this vendor other than being a customer.
      PeerSpot user