Try our new research platform with insights from 80,000+ expert users

Amazon Redshift vs Apache Hadoop comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
6.2
Amazon Redshift offers mixed returns; beneficial for high data volumes, but concerns arise over rising costs and limited data effectiveness.
Sentiment score
6.5
Apache Hadoop provides cost-effective data storage and processing, though ROI varies based on analytics use and sophistication.
 

Customer Service

Sentiment score
6.8
Amazon Redshift's support is responsive but costly, with quick responses for routine issues, though advanced support can be inconsistent.
Sentiment score
6.4
Customer service varies by Hadoop distributor, with Hortonworks rated highly; support depends on vendor, community resources, or external vendors.
It's costly when you enable support.
It's not structured support, which is why we don't use purely open-source projects without additional structured support.
 

Scalability Issues

Sentiment score
7.3
Amazon Redshift is generally praised for scalability, though experience varies with larger clusters and specific configurations.
Sentiment score
7.6
Apache Hadoop excels in scalability, allowing easy cluster expansion and efficient data handling, ideal for varied organizational needs.
The scalability part needs improvement as the sizing requires trial and error.
It is a distributed file system and scales reasonably well as long as it is given sufficient resources.
 

Stability Issues

Sentiment score
7.4
Amazon Redshift is praised for stability, high availability, and performance, despite occasional challenges with complex queries and environment changes.
Sentiment score
7.3
Apache Hadoop's stability, rated 8/10, improves with newer versions, though minor issues exist with memory and data processing.
Amazon Redshift is a stable product, and I would rate it nine or ten out of ten for stability.
Continuous management in the way of upgrades and technical management is necessary to ensure that it remains effective.
 

Room For Improvement

Amazon Redshift faces challenges in performance, integration, cost, and compatibility, needing improvements in speed, security, and serverless options.
Apache Hadoop needs improved usability, integration, security, support, and performance for efficient high-volume data processing and better community resources.
They should bring the entire ETL data management process into Amazon Redshift.
The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it.
 

Setup Cost

Amazon Redshift offers competitive pricing for large enterprises, but smaller organizations might find it more costly than alternatives.
Enterprise Hadoop offers cost benefits but varies with deployment type and distribution, impacting smaller organizations more heavily.
The cost of technical support is high.
It's a pretty good price and reasonable for the product quality.
The pricing of Amazon Redshift is expensive.
 

Valuable Features

Amazon Redshift provides scalable, efficient data processing with AWS integration, offering robust analytics, security, and user-friendly features.
Apache Hadoop excels with a scalable, cost-effective system handling diverse data types, integrating with tools, and supporting big data analytics.
Amazon Redshift's performance optimization and scalability are quite helpful, providing functionalities such as scaling up and down.
Scalability is also a strong point; I can scale it however I want without any limitations.
Security configurations are implemented across all processes, such as AWS Config and GuardDuty.
Hadoop is a distributed file system, and it scales reasonably well provided you give it sufficient resources.
 

Categories and Ranking

Amazon Redshift
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
70
Ranking in other categories
Cloud Data Warehouse (5th)
Apache Hadoop
Average Rating
7.8
Reviews Sentiment
6.7
Number of Reviews
40
Ranking in other categories
Data Warehouse (7th)
 

Featured Reviews

Ved Prakash Yadav - PeerSpot reviewer
Works as a data warehouse system and collects data from different sources
In terms of improvement, I believe Amazon Redshift could work on reducing its costs, as they tend to increase significantly. Additionally, there are occasional issues with nodes going down, which can be problematic. We often encounter issues like someone dropping a column or changing the order of columns, which can cause synchronization problems when pushing data through our pipeline. It's a minor issue, but it can be annoying.
Sushil Arya - PeerSpot reviewer
Provides ease of integration with the IT workflow of a business
When working with Kafka, I saw that the data came in an incremental order. The incremental data processing part is still not very effective in Apache Hadoop. If the data is already there, it can be processed very effectively, especially if the data is coming in every second. If you want to know the location of some data every second, then such data is not processed effectively in Apache Hadoop. I can say that one of the features where improvements are required revolves around the licensing cost of the tool. If the tool can build some licensing structures in a pay-per-use manner, organizations can get the look and feel of Apache Hadoop. Apache Hadoop can offer a licensing structure of the product that can be seen as similar to how AWS operates. Apache Hadoop can look into the capability of processing incremental data. The tool's setup process can be a scope of improvement. Also, it is not very simple because while doing the setup, we need to do all the server settings, including port listing and firewall configurations. If we look at other products on the market, then they can be made simpler. There are certain shortcomings when it comes to the product's technical support part, making it an area where improvements are required. The time frame for the resolution is an area that needs to be improved. The overall communication part of the technical support team also needs improvement.
report
Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
846,617 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
64%
Financial Services Firm
6%
Computer Software Company
5%
Manufacturing Company
3%
Financial Services Firm
34%
Computer Software Company
11%
University
7%
Energy/Utilities Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How does Amazon Redshift compare with Microsoft Azure Synapse Analytics?
Amazon Redshift is very fast, has a very good response time, and is very user-friendly. The initial setup is very straightforward. This solution can merge and integrate well with many different dat...
What do you like most about Amazon Redshift?
The tool's most valuable feature is its parallel processing capability. It can handle massive amounts of data, even when pushing hundreds of terabytes, and its scaling capabilities are good.
What do you like most about Apache Hadoop?
It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.
What is your experience regarding pricing and costs for Apache Hadoop?
The product is open-source, but some associated licensing fees depend on the subscription level. While it might be free for students, organizations typically need to pay for their subscriptions. Th...
What needs improvement with Apache Hadoop?
The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it. This wa...
 

Overview

 

Sample Customers

Liberty Mutual Insurance, 4Cite Marketing, BrandVerity, DNA Plc, Sirocco Systems, Gainsight, Blue 449
Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
Find out what your peers are saying about Amazon Redshift vs. Apache Hadoop and other solutions. Updated: April 2025.
846,617 professionals have used our research since 2012.