Try our new research platform with insights from 80,000+ expert users

Amazon EMR vs Snowflake comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Amazon EMR
Ranking in Cloud Data Warehouse
11th
Average Rating
7.8
Number of Reviews
21
Ranking in other categories
Hadoop (3rd)
Snowflake
Ranking in Cloud Data Warehouse
1st
Average Rating
8.4
Number of Reviews
98
Ranking in other categories
Data Warehouse (1st)
 

Mindshare comparison

As of November 2024, in the Cloud Data Warehouse category, the mindshare of Amazon EMR is 4.5%, up from 4.5% compared to the previous year. The mindshare of Snowflake is 28.7%, up from 23.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Cloud Data Warehouse
 

Featured Reviews

Quan Vu - PeerSpot reviewer
Aug 22, 2023
Provides efficient data processing features and has good scalability
We need to have a data pipeline tool to ensure consistent data processing for the initial setup. We create a framework, read the code, and execute it in a data catalog. The size of the maintenance team depends on the project and the use cases. Usually, one backup team of four or five DevOps executives takes care of the backend and database. We need to separate our environments into production and development. We use GitHub for source control, Jenkins for the deployment pipeline, and a standard CI/CD tool to deploy code changes into production. We need to develop a deployment framework so developers only need to provide the code for their projects. The underlying engine then deploys the code, reads it, addresses the EMR filter, executes it, and completes the data processing.
VivekSingh 1 - PeerSpot reviewer
Sep 11, 2024
Provides good data ingestion capability, but should include more AI capabilities
The solution's integration aspect is good, and all the connectors are in place. I found Snowflake similar to RDS. We use it for both data in motion and data in transit. It looks like the tool handles the data quite securely. We create ETL patterns. We ingest data from different source systems, and we have to create data pipelines. It would be useful if we could have AI features added to identify what I'm going to do with this data. It would be good if it could look at the data and help me create an automated pipeline instead of me creating a pipeline by myself. I'm from a retail background. I completed my Oracle DBA training a long time ago, about 18 years ago. I was quite familiar with the Snowflake and relational database concepts since I had already completed the Oracle ops, DBA ops, OCP, and OPA courses. For me, it was a journey similar to when I shifted from Oracle RDS to Snowflake. Although I was quite familiar with most of the concepts, there were some learnings. Whosoever is in the data field should at least try Snowflake once. They will then realize the best features in the solution and can continue using it. Overall, I rate the solution a seven out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"Amazon EMR's most valuable features are processing speed and data storage capacity."
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"The initial setup is pretty straightforward."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"The solution helps us manage huge volumes of data."
"It is very fast and the performance is great."
"The most valuable features are sharing data, Time Travel, Zero Copy Cloning, performance, and speed."
"Snowflake is a database, and it is very good and useful. The most interesting part is that memory management is very good in Snowflake. For a business intelligence project, SQL Server is taking a lot of time for reporting services. There are a lot of calculations, and the reporting time is shown as two minutes, whereas Snowflake is taking just two seconds for the same reporting services."
"It is a very easy-to-use solution. It is user-friendly, and its setup time is very less."
"The overall ecosystem was easy to manage. Given that we weren't a very highly technical group, it was preferable to other things we looked at because it could do all of the cloud tunings. It can tune your data warehouse to an appropriate size for controlled billing, resume and sleep functions, and all such things. It was much more simple than doing native Azure or AWS development. It was stable, and their support was also perfect. It was also very easy to deploy. It was one of those rare times where they did exactly what they said they could do."
"Time travel is one feature that really helps us out."
"Snowflake's most valuable features are data enrichment and flattening."
"Its performance is most valuable. As compared to SQL Server, we are able to see a significant improvement in performance with Snowflake."
 

Cons

"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"The initial setup was time-consuming."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"There is room for improvement in pricing."
"Amazon EMR is continuously improving, but maybe something like CI/CD out-of-the-box or integration with Prometheus Grafana."
"There is no need to pay extra for third-party software."
"There could be better ELT tools that are appropriate for Snowflake. We decided on Matillion and it seemed to be the only one. There need to be better choices, it would be great if Snowflake provided an ELT solution that people could use. Additionally, if there was a pure cloud-based ELT tool it would be useful."
"I don't think that the AI tools in Snowflake are good."
"It would benefit from an administration that allows you to be aware of your credit consumption once you have the service so that you may be sure how many credits you are consuming when you use the platform and to make sure that you are making the most efficient use of these resources. In other words, to improve their interface so that you may monitor the consumption of your credits on Cloud."
"Portability is a big hurdle right now for our clients. Porting all of your existing SQL ecosystem, such as stored procedures, to Snowflake is a major pain point. Currently, Snowflake stored procedures use JavaScript, but they should support SQL-based stored procedures. It would be a huge advantage if you can write your stored procedures using SQL. It seems that they are working on this feature, and they are yet to release it. I remember seeing some notes saying that they were going to do that in the future, but the sooner this feature comes out, it would be better for Snowflake because there are a lot of clients with whom I'm interacting, and their main hurdle is to take their existing Oracle or SQL Server stored procedures and move them into Snowflake. For this, you need to learn JavaScript and how it works, which is not easy and becomes a little tricky. If it supports SQL-based procedures, then you can just cut-paste the SQL code, run it, and easily fix small issues."
"The addition of more AI capabilities in Snowflake would help us more."
"If you go with one cloud provider, you can't switch."
"Snowflake can improve its machine learning and AI capabilities."
"The scheduling system can definitely be better because we had to use external airflow for that. There should be orchestration for the scheduling system. Snowflake currently does not support machine learning, so it is just storage. They also need some alternatives for SQL Query. There should also be support for Spark in different languages such as Python."
 

Pricing and Cost Advice

"I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you tens of thousands of dollars. That's happened to us twice, so I'm sensitive to it. We're still trying to work on that. Our smallest client probably spends a hundred thousand dollars yearly on licensing, while our largest is well over a million."
"Amazon EMR's price is reasonable."
"There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
"The cost of Amazon EMR is very high."
"The price of the solution is expensive."
"You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
"Amazon EMR is not very expensive."
"There is no need to pay extra for third-party software."
"Snowflake is cost-effective."
"The whole licensing system is based on credit points. You can also make a license agreement with the company so that you buy credit points and then you use them. What you do not use in one year can be carried over to the next year."
"Snowflake is expensive, but when I consider what we get for that price, it's fair. I rate the solution three out of five for affordability, right in the middle."
"The product's price range falls between average to a bit expensive range. I think the tool is worth the money if you use it properly."
"The price of the solution is reasonable."
"We used Snowflake to see if it is cheaper than using BigQuery. It was just to maintain the cost or the KPI regarding the cost of connectivity by users. Snowflake wasn't cheaper than BigQuery, and its affordability was the main issue."
"Snowflake has consumption-based costs; users only pay for storage and computing."
"Snowflake is a cost-effective solution."
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
815,854 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
25%
Computer Software Company
13%
Manufacturing Company
9%
Educational Organization
7%
Educational Organization
35%
Financial Services Firm
12%
Computer Software Company
9%
Manufacturing Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What is your experience regarding pricing and costs for Amazon EMR?
I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you...
What needs improvement with Amazon EMR?
The solution can become expensive if you are not careful.
What do you like most about Snowflake?
The best thing about Snowflake is its flexibility in changing warehouse sizes or computational power.
What is your experience regarding pricing and costs for Snowflake?
The pricing part is based on the computing and storage. The costs are different and then there are services costs as well. I have heard that Snowflake is costlier than Redshift or GCP BigQuery. A s...
What needs improvement with Snowflake?
I think people do not want to create pipelines for many customers now. Normally, we have this layer architecture, like layer one, layer two, layer three, or layer four, where we have raw data, inte...
 

Comparisons

 

Also Known As

Amazon Elastic MapReduce
Snowflake Computing
 

Overview

 

Sample Customers

Yelp
Accordant Media, Adobe, Kixeye Inc., Revana, SOASTA, White Ops
Find out what your peers are saying about Amazon EMR vs. Snowflake and other solutions. Updated: October 2024.
815,854 professionals have used our research since 2012.