One of the most popular comparisons on IT Central Station is Amazon Redshift vs Vertica.
People like you are trying to decide which one is best for their company. Can you help them out?
What is the biggest difference between Amazon Redshift and Vertica? Which of these two solutions would you recommend to a colleague evaluating cloud data warehouses and why?
Thanks for helping your peers make the best decision!
I am assuming you are asking me a question. I have similar points like the one you sent in the link. However, I will narrow it down further with
highlighted ones that matter.
Redshift is cloud-native so not a good idea if you also need a data center. It does not support document store, so that's another limitation.
Vertica works for the hybrid environment and also supports more APIs and supports all familiar languages.
Look at more features as given in the below table and decide. I would go with Vertica if there is a hybrid cloud, needs document store, secondary indexes are a critical need and need support for more APIs (as given in the table). For AWS native environment, go with RedShift.
Amazon Redshift X:
Description: Large scale data warehouse service for use with business intelligence tools.
Primary database model: Relational DBMS
Secondary database models: Key-value store
Website: aws.amazon.com/redshift
Technical documentation: docs.aws.amazon.com/redshift
Developer: Amazon (based on PostgreSQL)
License: commercial
Cloud-based only: Yes
XML Support: No
Secondary indexes: Restricted
SQL: Yes
APIs and other access methods: JDBC, ODBC
Supported programming languages: All languages supporting JDBC/ODBC
Server-side scripts: User-defined functions
Triggers: No
Vertica X:
Description: Columnar relational DBMS designed to handle modern analytic workloads, enabling fast query performance.
Primary database model: Relational DBMS
Secondary database models: Key-value store, Document store
Website: www.vertica.com
Technical documentation: www.vertica.com/documentation/vertica
Developer: Vertica / Micro Focus
License: Commercial
Cloud-based only: No
XML Support: Yes
Secondary indexes: No
SQL: Yes
APIs and other access methods: Kafka, Proprietary protocol, RESTful HTTP API, ADO.NET, JDBC, ODBC
Supported programming languages: C++, Java, Perl, Python, R
Server-side scripts: Yes
Triggers: No
* Flexibility of deployment: Amazon Redshift can only be deployed on the AWS. Redshift lacks support for the hybrid cloud-and-on premises data warehousing combinations. Vertica’s industry-leading deployment options allow customers to run Vertica on AWS, Azure, Google Cloud Platform, on-premise, and on Hadoop.
* Speed: The query execution performance of Vertica and Redshift and some of the differentiators in the two products with Vertica query response times on the 5 TB dataset, 20 concurrent user test, for example, were 3.4 times faster. Also, when over 100 users are simultaneously running queries on the same data, Amazon Redshift’s performance degrades considerably, leading to hours or even days of query response times.
Which of these two solutions would you recommend to a colleague evaluating cloud data warehouses and why? Vertica. As Vertica has flexibility as your data warehouse evolves long term with best in class speed.