Try our new research platform with insights from 80,000+ expert users

Amazon EMR vs Snowflake comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 18, 2024
 

Categories and Ranking

Amazon EMR
Ranking in Cloud Data Warehouse
11th
Average Rating
7.8
Number of Reviews
22
Ranking in other categories
Hadoop (3rd)
Snowflake
Ranking in Cloud Data Warehouse
1st
Average Rating
8.4
Reviews Sentiment
7.5
Number of Reviews
98
Ranking in other categories
Data Warehouse (1st)
 

Mindshare comparison

As of December 2024, in the Cloud Data Warehouse category, the mindshare of Amazon EMR is 4.4%, down from 4.6% compared to the previous year. The mindshare of Snowflake is 28.9%, up from 24.0% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Cloud Data Warehouse
 

Featured Reviews

Prashant  Singh - PeerSpot reviewer
Seamless data integration enhances reporting efficiency and an easy setup
Amazon EMR has multiple connectors that can connect to various data sources. The service charges are based on processing only, depending on the resources used, which can help save money. It is easy to integrate with other services for storage, allowing data to be shifted to cheaper storage based on usage.
VivekSingh 1 - PeerSpot reviewer
Provides good data ingestion capability, but should include more AI capabilities
The solution's integration aspect is good, and all the connectors are in place. I found Snowflake similar to RDS. We use it for both data in motion and data in transit. It looks like the tool handles the data quite securely. We create ETL patterns. We ingest data from different source systems, and we have to create data pipelines. It would be useful if we could have AI features added to identify what I'm going to do with this data. It would be good if it could look at the data and help me create an automated pipeline instead of me creating a pipeline by myself. I'm from a retail background. I completed my Oracle DBA training a long time ago, about 18 years ago. I was quite familiar with the Snowflake and relational database concepts since I had already completed the Oracle ops, DBA ops, OCP, and OPA courses. For me, it was a journey similar to when I shifted from Oracle RDS to Snowflake. Although I was quite familiar with most of the concepts, there were some learnings. Whosoever is in the data field should at least try Snowflake once. They will then realize the best features in the solution and can continue using it. Overall, I rate the solution a seven out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"The solution helps us manage huge volumes of data."
"Amazon EMR is a good solution that can be used to manage big data."
"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"This is the best tool for hosts and it's really flexible and scalable."
"It has a variety of options and support systems."
"Amazon EMR's most valuable features are processing speed and data storage capacity."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"The speed of data loading and being able to quickly create the environment are most valuable."
"The initial setup is very simple."
"Working with Parquet files is support out of the box and it makes large dataset processing much easier."
"The solution speeds up the process of onboarding."
"It is a highly scalable solution. There is no limit on storage or computing."
"The most efficient way for real-time dashboards or analytical business intelligence reports to be sent to the customer."
"The way it is built and designed is valuable. The way the shared model is built and the way it exploits the power of the cloud is very good. Certain features related to administration and management, akin to Oracle Flashback and all that, are very important for modern-day administration and management. It is also good in terms of managing and improving performance, indexing, and partitioning. It is sort of completely automated. Everything is essentially under the hood, and the engine takes care of it all. As a data warehouse on the cloud, Snowflake stands strong on its ground even though each of the cloud providers has its own data warehouse, such as Redshift for AWS or Synapse for Azure."
"The platform's most valuable features include its ability to effectively summarize and manage large datasets, allowing multiple teams to analyze and generate insights."
 

Cons

"The initial setup was time-consuming."
"The most complicated thing is configuring to the cluster and ensure it's running correctly."
"Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services."
"The solution can become expensive if you are not careful."
"There is room for improvement in pricing."
"There is no need to pay extra for third-party software."
"The dashboard management could be better. Right now, it's lacking a bit."
"Spark jobs take longer on Amazon EMR compared to previous experiences."
"The pricing of the solution should be much easier to calculate or find by yourself."
"The scheduling system can definitely be better because we had to use external airflow for that. There should be orchestration for the scheduling system. Snowflake currently does not support machine learning, so it is just storage. They also need some alternatives for SQL Query. There should also be support for Spark in different languages such as Python."
"I would like to see a client version of the GUI."
"Snowflake needs to improve its programming part. Though the tool has Snowpath, it doesn’t support all features like its competitor, Databricks. Snowflake doesn’t support external data ingestion capabilities. You need to have third-party tools for that. Also, the tool needs to incorporate data integration features in its future releases."
"It doesn't enforce typical relational database constraints. Quite expensive."
"Sometimes it can be tricky to manage multiple environments if you're purely using Snowflake as your scripting and pipeline environment."
"The design of the product is easily misunderstood."
"There are some stored procedures that we've had trouble with. The solution also needs to fine-tune the connectors to be able to connect into the system source."
 

Pricing and Cost Advice

"The cost of Amazon EMR is very high."
"There is no need to pay extra for third-party software."
"The price of the solution is expensive."
"You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
"Amazon EMR's price is reasonable."
"The product is not cheap, but it is not expensive."
"There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
"I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you tens of thousands of dollars. That's happened to us twice, so I'm sensitive to it. We're still trying to work on that. Our smallest client probably spends a hundred thousand dollars yearly on licensing, while our largest is well over a million."
"Users have to pay a licensing fee for the solution, which is expensive."
"Snowflake is a cost-effective solution."
"There is a separation of storage and compute, so you only pay for what you use."
"Oracle is less expensive than Snowflake."
"It is pay-as-you-go. Its cost is in the medium range."
"It is per credit. It has a use-it-as-you-go model. We bought a chunk of 20,000 credits, and they were lasting us for at least a year. We didn't have the scale of data like a much larger company to consume more credits. For us, it was very inexpensive. Their strategy is just to leverage what you've got and put Snowflake in the middle. It doesn't make it expensive because most of the organizations already have reporting tools. Now, if you were starting from scratch, it might be cheaper to go a different way."
"It's expensive."
"Pricing is based on usage. It is the most expensive of our data tools."
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
823,875 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
24%
Computer Software Company
13%
Manufacturing Company
9%
Educational Organization
7%
Educational Organization
36%
Financial Services Firm
12%
Computer Software Company
8%
Manufacturing Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What is your experience regarding pricing and costs for Amazon EMR?
The cost of Amazon EMR is a little bit expensive, especially considering the support package, which includes a gold package.
What needs improvement with Amazon EMR?
Spark jobs take longer on Amazon EMR compared to previous experiences. This aspect could be improved to make them more efficient.
What do you like most about Snowflake?
The best thing about Snowflake is its flexibility in changing warehouse sizes or computational power.
What is your experience regarding pricing and costs for Snowflake?
The pricing part is based on the computing and storage. The costs are different and then there are services costs as well. I have heard that Snowflake is costlier than Redshift or GCP BigQuery. A s...
What needs improvement with Snowflake?
I think people do not want to create pipelines for many customers now. Normally, we have this layer architecture, like layer one, layer two, layer three, or layer four, where we have raw data, inte...
 

Also Known As

Amazon Elastic MapReduce
Snowflake Computing
 

Overview

 

Sample Customers

Yelp
Accordant Media, Adobe, Kixeye Inc., Revana, SOASTA, White Ops
Find out what your peers are saying about Amazon EMR vs. Snowflake and other solutions. Updated: December 2024.
823,875 professionals have used our research since 2012.