Try our new research platform with insights from 80,000+ expert users

Apache Hadoop vs BigQuery comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
6.5
Apache Hadoop offers cost-effective storage and processing, with varying returns based on analytics sophistication and workload optimization.
Sentiment score
8.6
Organizations saved costs and improved performance with BigQuery, achieving significant returns despite an initial learning period.
 

Customer Service

Sentiment score
6.5
Apache Hadoop's support varies, with high satisfaction from vendor packages, responsive teams, and helpful documentation and community.
Sentiment score
7.1
Google BigQuery support is generally reliable and agile but lacks direct engagement compared to competitors like Teradata.
rating the customer support at ten points out of ten
 

Scalability Issues

Sentiment score
7.6
Apache Hadoop offers scalable data management for large-scale deployments, efficiently supports diverse users and adapts across industries.
Sentiment score
7.9
BigQuery excels in scalability and performance for large operations but may be costly for smaller businesses.
The scalability is definitely good because we are migrating to the cloud since the computers on the premises or the big database we need are no longer enough.
 

Stability Issues

Sentiment score
7.4
Apache Hadoop is stable, especially newer versions, with occasional issues in setup, memory, and online data ingestion.
Sentiment score
8.5
BigQuery is praised for stability, reliability, and performance but has minor glitches with room for improvement in some areas.
 

Room For Improvement

Apache Hadoop requires enhanced compatibility, improved usability, real-time processing, better security, modern interfaces, and cost-effective solutions to boost adoption.
BigQuery's drawbacks include special character restrictions, high pricing, integration issues, and needed improvements in user interface and support.
Troubleshooting requires opening each pipeline individually, which is time-consuming.
In general, if I know SQL and start playing around, it will start making sense.
 

Setup Cost

Apache Hadoop is cost-effective for large-scale deployments, but smaller enterprises face higher expenses despite potential cloud cost savings.
BigQuery's pricing is flexible, based on usage, with low storage costs, and customizable to enterprise needs within Google Cloud.
The price is perceived as expensive, rated at eight out of ten in terms of costliness.
 

Valuable Features

Apache Hadoop offers cost-efficient, scalable data processing with HDFS, supporting large datasets and seamless integration with tools like Spark.
BigQuery provides scalable, fast, cost-effective data analytics with seamless GCP integration and supports complex queries and various data types.
It is really fast because it can process millions of rows in just a matter of one or two seconds.
BigQuery processes a substantial amount of data, whether in gigabytes or terabytes, swiftly producing desired data within one or two minutes.
 

Categories and Ranking

Apache Hadoop
Average Rating
7.8
Reviews Sentiment
6.8
Number of Reviews
39
Ranking in other categories
Data Warehouse (8th)
BigQuery
Average Rating
8.2
Reviews Sentiment
7.3
Number of Reviews
40
Ranking in other categories
Cloud Data Warehouse (4th)
 

Featured Reviews

Sushil Arya - PeerSpot reviewer
Provides ease of integration with the IT workflow of a business
When working with Kafka, I saw that the data came in an incremental order. The incremental data processing part is still not very effective in Apache Hadoop. If the data is already there, it can be processed very effectively, especially if the data is coming in every second. If you want to know the location of some data every second, then such data is not processed effectively in Apache Hadoop. I can say that one of the features where improvements are required revolves around the licensing cost of the tool. If the tool can build some licensing structures in a pay-per-use manner, organizations can get the look and feel of Apache Hadoop. Apache Hadoop can offer a licensing structure of the product that can be seen as similar to how AWS operates. Apache Hadoop can look into the capability of processing incremental data. The tool's setup process can be a scope of improvement. Also, it is not very simple because while doing the setup, we need to do all the server settings, including port listing and firewall configurations. If we look at other products on the market, then they can be made simpler. There are certain shortcomings when it comes to the product's technical support part, making it an area where improvements are required. The time frame for the resolution is an area that needs to be improved. The overall communication part of the technical support team also needs improvement.
VikashKumar1 - PeerSpot reviewer
Easy to maintain and provides high availability
Since I used BigQuery over the GCP cloud environment, I'm not sure whether we can go through internal IDEAs like IntelliJ or DBeaver that we use to connect with databases. Instead of connecting directly to BigQuery, we connect to GCP, Cloud Run, and then to BigQuery, which is a long process. Sometimes, we face some issues, bugs, and defects. We must first connect with a VPN to check data issues while working from home. Then, it allows you to connect to the cloud. After logging into the cloud, it searches for the service we are looking for, and then we go to BigQuery. This is a long process. After that, we analyze the issues in a table. Instead, it would be very helpful if it could provide a tool that we can install on our MacBook or Windows system. Once we open this tool, we can connect directly to the BigQuery server and easily perform tasks.
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
839,319 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
35%
Computer Software Company
10%
University
7%
Energy/Utilities Company
5%
Computer Software Company
17%
Financial Services Firm
15%
Manufacturing Company
12%
Retailer
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Hadoop?
It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.
What is your experience regarding pricing and costs for Apache Hadoop?
The product is open-source, but some associated licensing fees depend on the subscription level. While it might be free for students, organizations typically need to pay for their subscriptions. Th...
What needs improvement with Apache Hadoop?
Hadoop lacks OLAP capabilities. I recommend adding a Delta Lake feature to make the data compatible with ACID properties. Also, video and audio streaming import issues could be improved to ensure p...
What do you like most about BigQuery?
The initial setup process is easy.
What is your experience regarding pricing and costs for BigQuery?
The price is perceived as expensive, rated at eight out of ten in terms of costliness. Still, it offers significant cost savings.
What needs improvement with BigQuery?
When I open many of the Google Cloud products, I am in an environment that I do not feel familiar with; it is a little overwhelming. In general, if I know SQL and start playing around, it will star...
 

Comparisons

 

Overview

 

Sample Customers

Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
Information Not Available
Find out what your peers are saying about Apache Hadoop vs. BigQuery and other solutions. Updated: January 2025.
839,319 professionals have used our research since 2012.