We use the solution to build a data warehouse schema for a target database for analytics. We are uploading data from different transactional databases into Amazon Redshift. We use it for reporting purposes. We use the tool mainly for querying and retrieving the data for analytics.
Composition Data Architect at Intellias
A powerful database system that works quickly with huge volumes of data
Pros and Cons
- "Amazon Redshift is a really powerful database system for reporting and data warehousing."
- "The product must provide new indexes that support special data structures or data types like TEXT."
What is our primary use case?
How has it helped my organization?
The fast querying of a huge amount of data greatly impacts our data workflows. All the queries work pretty fast.
What is most valuable?
Amazon Redshift is a really powerful database system for reporting and data warehousing. I like the product. It works really fast with significant volumes of data. The product covers all the main functionalities required for our data security and compliance needs. It has almost everything we need. It is the main data source for our analytics functionality. We can run our models using the data stored in the database. The ease of use is fine. It is pretty easy to integrate the solution with other products and third-party solutions.
What needs improvement?
The product must provide new indexes that support special data structures or data types like TEXT.
Buyer's Guide
Amazon Redshift
February 2025
Learn what your peers think about Amazon Redshift. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
832,138 professionals have used our research since 2012.
What do I think about the stability of the solution?
I have no complaints about the product’s stability.
What do I think about the scalability of the solution?
The tool is scalable. About 30 to 50 analysts use the solution in our organization. We need one or two people to administer the solution.
How are customer service and support?
I haven't heard any complaints about the support team from our DevOps engineers.
Which other solutions did I evaluate?
My project involves analytics and data warehousing. I use Amazon Redshift. I also use AWS Glue as an ETL tool.
What other advice do I have?
I will recommend the product to others for data warehousing and data analytics. However, I do not recommend the solution for small companies that do not have enough volume of data to analyze. Overall, I rate the product an eight out of ten.
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
Last updated: May 15, 2024
Flag as inappropriateDWH, BI & Big Data consultant / developer /modeler - independent contractor at Freelancer
Helps us create SQL ETL procedures in a business system
Pros and Cons
- "I like it because the usage is very similar to Microsoft SQL server. The structure of the query and the temporary tables are very similar."
- "The explain panel in the Redshift database could be better."
What is our primary use case?
I use Amazon Redshift for the creation of SQL ETL procedures in a business system. Business people check this in a front-end application, and it helps them plan sales for the next year.
Redshift is being deployed on a Microsoft Azure server.
There are about six people working on this project and using the solution, but there are many similar projects running on Oracle and Redshift.
What is most valuable?
I like it because the usage is very similar to Microsoft SQL server. The structure of the query and the temporary tables are very similar. Until recently, I thought it was the superior database, but now I think that Redshift is better.
What needs improvement?
The explain panel in the Redshift database could be better.
For how long have I used the solution?
I have used this solution for 10 months.
What do I think about the stability of the solution?
The solution is stable. I haven't had any problems or downfall with the database in the 10 months that I have used the solution.
Which solution did I use previously and why did I switch?
I have also used Microsoft SQL server and Oracle.
How was the initial setup?
Setup was difficult because I had to set up 25 connections with different users and passwords. The connections have been predefined, but there were still problems when trying to connect for the first time. I had some problems with some certifications that were malfunctioning. This might have had something to do with the functionality of my keyboard because if I pushed a random combination on the keyboard, it would delete the certificate from the folder and the connection wouldn't work. I think this is a problem with the remote desktop rather than with Redshift.
What other advice do I have?
I would rate this solution as eight out of ten. I can't give it a higher score because there are some issues with variable character columns in the table. Otherwise, it's a great database.
Some of my former colleagues from a previous job have joined my organization, and they have had some issues with the SQRs because some things work differently in Redshift, like the partition bar. If someone has issues with Redshift, my advice is to check with support.
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Amazon Redshift
February 2025
Learn what your peers think about Amazon Redshift. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
832,138 professionals have used our research since 2012.
Business Intelligence Software Engineer at Suncorp Group Holdings (NZ) Limited
Capable of handling large-scale datasets for businesses but needs to reduce its prices
Pros and Cons
- "I am satisfied with the performance of the product."
- "The high price of the product is an area of concern where improvements are required."
What is our primary use case?
I use the solution in my company for our data warehouse and databases.
What needs improvement?
The high price of the product is an area of concern where improvements are required.
For how long have I used the solution?
I have been using Amazon Redshift for three years.
Which solution did I use previously and why did I switch?
My company uses IBM Cognos as a reporting portal on top of Amazon Redshift.
My company moved from Netezza to Amazon Redshift since the latter is available on the cloud platform. My company used some migration tools to shift from Netezza to Amazon Redshift, and it took almost a year.
What was our ROI?
Amazon Redshift was not helpful in improving our organization's functioning, and I believe that previously, we had a better database named Netezza in place.
What's my experience with pricing, setup cost, and licensing?
It is an expensive product.
What other advice do I have?
The tool can handle some large-scale datasets for our company since we use data shares.
I am satisfied with the performance of the product.
The product is able to fulfill my company's needs associated with data analytics.
Amazon Redshift is used as a storage tool.
I rate the tool a seven out of ten.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Last updated: Jun 18, 2024
Flag as inappropriateSolution Architect at Capgemini
Plenty of features, high availability, and elaborate documentation
Pros and Cons
- "For the on-premises version of Amazon Redshift, we need to start from scratch. However, with the cloud version whenever you want to deploy, you can scale up, and down, and it has a data warehousing capability. Redshift has many features."
- "We are using third-party tools to integrate Amazon Redshift, they should create their own interface on their own for it to be easily connected on the AWS itself."
What is most valuable?
For the on-premises version of Amazon Redshift, we need to start from scratch. However, with the cloud version whenever you want to deploy, you can scale up, and down, and it has a data warehousing capability. Redshift has many features.
They have enriched and elaborate documentation that is helpful.
What needs improvement?
We are using third-party tools to integrate Amazon Redshift, they should create their own interface on their own for it to be easily connected on the AWS itself.
For how long have I used the solution?
I have been using Amazon Redshift for one year.
What do I think about the stability of the solution?
Amazon Redshift is reliable and has high availability.
What do I think about the scalability of the solution?
The scalability of Amazon Redshift is good.
We have approximately 20 people using this solution in my organization.
Which solution did I use previously and why did I switch?
Before we were using Amazon Redshift, we were working with Postgres, Greenplum, and Oracle SQL. These were on-premises databases, and we migrated to the cloud.
What's my experience with pricing, setup cost, and licensing?
The price of Amazon Redshift is reasonable because it depends on the usage that you use and for DWH for the long term.
What other advice do I have?
I would advise others that if they have a large set of data where you have a less number of updates, then choose Amazon Redshift. If there is more update and fewer inserts, then do not use Amazon Redshift.
I rate Amazon Redshift an eight out of ten.
Which deployment model are you using for this solution?
Private Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
VP, Data and Insights at a tech company with 201-500 employees
Secure and reliable data warehouse for transactional and clickstream data
Pros and Cons
- "If the analyst knows SQL, which is comfortable and easy to use to go between all of these tool stacks, I think it's reliable. It's a secure and reliable data warehouse."
- "There are physically too many pipelines for a company of this size to maintain. For a data scientist, it's very difficult to learn the data in all of these different environments."
What is our primary use case?
We use this solution for production and customer data for one of the lines of the business, which is basically one of the warehouses. I oversee the whole architecture infrastructure.
What is most valuable?
The price isn't bad for the performance for a cloud data warehouse. It's also connected to Databricks but uses SQL. It's comparable to BigQuery. If the analyst knows SQL, which is comfortable and easy to use to go between all of these tool stacks, I think it's reliable. It's a secure and reliable data warehouse.
The performance is good, and it's pretty fast. We also have Looker and MOLE connected to it for visualization, which works seamlessly. We're storing a lot of data. There's a lot of transactional data, clickstream data, and telemetry about the customer, what they're purchasing, call logs, and marketing data.
The analysts are familiar with SQL, and they're able to do this. Even the data scientists who aren't that savvy in Python, because they are very strong in SQL, are able to interact with it very quickly. I'm able to bring in more analysts for support.
What needs improvement?
There are physically too many pipelines for a company of this size to maintain. For a data scientist, it's very difficult to learn the data in all of these different environments. It's easier to train people in just one environment to start with, like Snowflake or Databricks. It's difficult to have so many technologies that are very comparable, and each comes with a price tag.
For how long have I used the solution?
I have used this solution for seven months in my current company, but I have also used it in a couple of my previous companies.
What do I think about the stability of the solution?
The stability is good. For a mid-size company, it's very stable.
What do I think about the scalability of the solution?
It's very scalable.
How are customer service and support?
I would rate the technical support as 10 out of 10.
We had one or two tickets, and they responded extremely quickly. Technical support is good, but that's because we aren't running into many issues. The solution is pretty stable. The individuals who set it up did a very good job.
How was the initial setup?
It wasn't too complex because only a few people were needed to set up the solution.
What's my experience with pricing, setup cost, and licensing?
It's not very pricey compared to other tools. I would rate the price as 5 out of 10.
Which other solutions did I evaluate?
In my organization, we also use GCP, Amazon EC2, Databricks, and Snowflake. We also have Delta Lakehouse. I'm going to move everything into the Delta Lakehouse. For a company of this size, it's a lot of tools to physically maintain with a small data engineering team.
I think Snowflake has a few more features. In Redshift, you need to write a bit more SQL in some instances, but it's very user-friendly and fast. It can be used as a data warehouse solution as well. It can also do some analytics.
Redshift is comparable to other solutions. I wanted to go with Amazon EC2 because we also have Databricks, and I think I can cover some of those features with the combination of that.
What other advice do I have?
I would rate this solution as eight out of ten. It's a very strong solution.
My advice is to do your research and see if it makes sense for you. You can always request a demo from Redshift.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Senior System Engineer at a computer software company with 10,001+ employees
Has good data updates and latency between data-refreshes
Pros and Cons
- "In terms of valuable features, I like the columnar storage that Redshift provides. The storage is one of the key features that we're looking for. Also, the data updates and the latency between the data-refreshes."
- "Pricing is one of the things that it could improve. It should be more competitive."
What is our primary use case?
My primary use for Amazon Redshift is for analytical purposes.
What is most valuable?
In terms of valuable features, I like the columnar storage that Redshift provides. The storage is one of the key features that we're looking for. Also, the data updates and the latency between the data-refreshes are valuable.
What needs improvement?
Pricing is one of the concerns that I have because if you compare Snowflake with Redshift, it provides some of the same services, but at a much cheaper rate. So pricing is one of the things that it could improve. It should be more competitive.
Otherwise, everything else looks good, especially the data storage and analytical processes.
For how long have I used the solution?
I just started using Amazon Redshift. I'm still learning about the product and about how the pricing and the dynamic skilling are done.
What do I think about the stability of the solution?
I understand that Amazon Redshift is quite stable and it has features like higher readability. So I think it should be a good product to be used.
How are customer service and technical support?
I have not been in touch with support but I will be needing their support in the near future to help me set up the environment.
How was the initial setup?
In terms of the initial setup, as I said I've still not started using it, so I pray that it's going to be pretty easy for me to set it up.
What's my experience with pricing, setup cost, and licensing?
Of course I would advise others to choose Amazon Redshift, as long as pricing is not a concern for them.
What other advice do I have?
On a scale of 1 - 10, where 1 is the worst and 10 is the best, I'd give it an 8.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
Senior Data Scientist at a tech services company with 51-200 employees
The solution works fast and we use it for marketing analytics
Pros and Cons
- "The solution is scalable. It handles different loads very well."
- "They should provide a better way to work with interim data in a structured way than to store it in parquet files locally."
What is our primary use case?
We mainly use the solution for marketing analytics.
What is most valuable?
The solution works fast. I use Redshift to clear a lot of web page data. I use it mainly as an extraction tool to obtain the information I need for a project and store it in parquet files on a disk. Later, I work on the data using Python. I write back all my final results to Redshift and store the temporary files on a local machine.
What needs improvement?
They should provide a structured way to work with interim data than to store it in parquet files locally. Also, Redshift is unwieldy. There should be better integration between Python and Redshift. It could be more accessible without using so many sequels.
They should make writing and reading the data frames into and from Redshift easier. The performance could be better. I have used Redshift for extensive queries. For the large tables, it's easier to unload to Redshift, but subquery tables that run complex grids are slower for configuration. I have to use the unloaded command to unload the whole table. Further, I have to read the table into a server with extensive memory in Python and process the data ahead. It's not optimal.
For how long have I used the solution?
We have been using the solution for two years.
What do I think about the stability of the solution?
I rate the solution’s stability as a nine.
What do I think about the scalability of the solution?
The solution is scalable. It handles different loads very well. We have 80-100 users using the solution in our company.
How was the initial setup?
The setup was quite complicated. For instance, if you use AWS Glue to automate loads into Redshift, setting up the security for the requirements between the two is complex. I've struggled a lot to set up the cluster on VPC and to get all the endpoints set up correctly with the right access and services. Especially from Glue's endpoints, I had to repeat the same process every time. It consumes a lot of time. In comparison, the CloudOps executives do the setup very quickly.
What's my experience with pricing, setup cost, and licensing?
I have heard complaints about the solution’s pricing, and thus I rate it as a five.
What other advice do I have?
I rate the solution as an eight out of ten.
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
BI Architect & Developer (contract) at a retailer with 501-1,000 employees
You can configure tables to live in the memory of all of the available cores.
What is most valuable?
Column store and distributed processing is optimized for read access. We grew to 3000+ users with no impact.
Column store is a data compression technique for relational data. I’m using it now in SQL Server 2016. We configured a 16-core VM for handling requests on the DB. The recommendation was to separate inbound data packets into related chunks, which were 1/16th of the size.
This way, the import process could make full use of parallelization, and it worked. We imported 20 million rows of sales facts in less than 15 seconds, and the content was query-able immediately. I’ve never seen that before. This was impressive. This meant that we could completely rebuild the data warehouse to “current” from "scratch" within minutes, assuming that the data was in S3 already.
Tables that would typically be 2GB in size are now about 250MB. This means more data in memory. You can also configure the tables to live in the memory of all of the available cores. This is good for small dimension tables. You can also fragment them across all cores, for the larger fact tables. This allows for distributed query processing. Once you set it up, it just worked. It was all specified in the PG-SQL table statements.
There were two data centers in Sydney that were guaranteeing us a distributed solution. We really didn’t notice this. It was more of a check box situation. At one point, there was an outage at AWS, but it didn’t impact our operations directly.
How has it helped my organization?
This has given us the ability to provide metrics to the large number of company staff on their performance without impacting core systems.
What needs improvement?
I’d like to see these RedShift features arrive in other languages, such as SQL's ColumnStore index.
.
For how long have I used the solution?
I have used this solution for three years.
What do I think about the stability of the solution?
There have been no stability issues.
How are customer service and technical support?
Technical support always met my expectations.
Which solution did I use previously and why did I switch?
I was on a team that was using AWS tools for Dick Smith Electronics (now liquidated). The tools ceased use in February of 2016.
Prior to that, we were using them fully for about 3 years. We loaded data to Redshift according to the best practices included in the online docs and through consultation with the AWS staff. The combination of S3 and Redshift for this purpose was very high in performance. Redshift was used to provide the data model to an instance of MicroStrategy for BI reporting.
We were using MicroStrategy, which generated all the SQL that our reporting services needed.
As such, I could only comment on the data engineering phase. Technically, this was so impressive that I don’t know what to add. I don’t recall feeling that it missed anything. If anything, I was not using all the available features. AWS documentation is great in this regard. You can tell they have put a lot of thought into it.
A lot of the future direction in database technology has to do with memory optimization and concurrency (VoltDB). This is more targeted towards transactional processing, and not data warehousing.
Memory-only data warehousing solves a lot of access issues without having to think too hard about the problem from the consumers' point of view. I am sure that you can already configure this.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Download our free Amazon Redshift Report and get advice and tips from experienced pros
sharing their opinions.
Updated: February 2025
Product Categories
Cloud Data WarehousePopular Comparisons
Azure Data Factory
Snowflake
Teradata
Microsoft Azure Synapse Analytics
Vertica
Amazon EMR
AWS Lake Formation
Oracle Autonomous Data Warehouse
SAP Business Warehouse
IBM Db2 Warehouse on Cloud
Firebolt
Buyer's Guide
Download our free Amazon Redshift Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What is the major difference between AWS Redshift and Snowflake?
- What is the biggest difference between Amazon Redshift and Vertica
- How does Amazon Redshift compare with Microsoft Azure Synapse Analytics?
- What are the challenges faced during migrating from Netezza to AWS Redshift?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?