Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs Vertica comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
Hadoop (2nd), NoSQL Databases (8th)
Vertica
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
86
Ranking in other categories
Data Warehouse (5th), Cloud Data Warehouse (8th)
 

Featured Reviews

Shahan Rehman - PeerSpot reviewer
Can host multiple technologies and help businesses with their AI initiatives
The ease or difficulty in setting up the product depends on the environment of the customer where the tool is deployed. If a banking, industrial, or retail sector firm is taken into concentration, depending on how big of a database is maintained, including the applications that are to be hosted, the deployment process can range from a simple to a very complex phase, depending on the architecture. For Cloudera Distribution for Hadoop, one has to go through the usual deployment process, like for any software product. You have to have different environments before going into production, like pre-production environments, test and dev environments. You install and configure all the components in the test environment and then test them on the pre-production environment. Once UAT is done, you move them to the production environment. In general, it's a critical product deployed in a company.
T Venkatesh - PeerSpot reviewer
Processes query faster through multiple systems simultaneously, but it could support different data types
We use the solution for various tasks, including preparing data marts and generating offers. It helps extract data based on rules from the policy team and provides insights to enhance business operations. We also analyze transactions to target customers and improve business performance The…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is Kubernetes."
"The main advantage is the storage is less expensive."
"I don't see any performance issues."
"The product provides better data processing features than other tools."
"The most valuable feature is Impala, the querying engine, which is very fast."
"The solution's most valuable feature is the enterprise data platform."
"The search function is the most valuable aspect of the solution."
"The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions."
"Allows us to take volumes and process them at a very high speed."
"Integrated R and geospatial functions are helping us improve efficiency and explore new revenue streams. ​"
"The extensibility and efficiency provided by their C++ SDK."
"The most valuable feature of Vertica is the ability to receive large aggregations at a very quick pace. The use case of subclusters is very good."
"Vertica has a few features that I like. From an architecture standpoint, they have separated compute and storage. So you have low-cost object storage for primary storage and the ability to have several sub-clusters working off the same ObjectStore. So it provides workload isolation."
"Vertica is a columnar database where the query performance is extremely fast and it can be used for real-time integrations for API and other applications. The solution requires zero maintenance which is helpful."
"It's the fastest database I have ever tested. That's the most important feature of Vertica."
"The feature I like best is performance. We use Red Tool and Red Job for the data warehouse and reporting. It's perfect. Performance is good, and it can return ad hoc queries very quickly. Of course, it's a cluster, so it's easy to scale."
 

Cons

"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there is a lot of things that need to improve."
"This is a very expensive solution."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"There are better solutions out there that have more features than this one."
"It could be faster and more user-friendly."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The dashboard could be improved."
"The tool's ability to be deployed on a cloud model is an area of concern where improvements are required."
"Limitations in group by projections is where I would like to see an improvement."
"Metadata for database files scale okay, but metadata related to tables/columns/sequences must be stored on all nodes."
"I would personally like to see extended developer tooling suited to Vertica – think published PowerDesigner SQL dialect support."
"Promotion/marketing must be improved, even though it is a very useful product at very good price, it is not as "popular" as it should be."
"Very bad support, I would rate it two out of 10."
"Pricing could be more competitive."
"I have found that coding support could be simplified."
"Vertica seems to scale well, except for one use case where you are on a multi-node cluster. For example, if you had a nine-node cluster, one node goes down, then the eight nodes don't scale, because the absence of the node is very apparent, which is a problem. If you have nine nodes or multiple nodes, the whole idea is that if one of those nodes goes down, then you should not see an impact on the system if you have enough capacity. Even though we have enough capacity, you can still see the impact of the one node going down."
 

Pricing and Cost Advice

"The tool is not expensive."
"The solution is fairly expensive."
"The pricing must be improved."
"The price could be better for the product."
"I wouldn't recommend CDH to others because of its high cost."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"Cloudera requires a license to use."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"Start with license per 1TB. Starting from hundreds of TB there is unlimited licensing to be considered. Move historical data to HDFS/S3 which are significantly cheaper or even free."
"I think it's starting to get a little expensive. Open source products are starting to get more robust, so I think that's something that they need to start looking at in terms of licensing."
"The pricing and licensing depend on the size of your environment and the zone where you want to implement."
"The price of Vertica is less expensive than some competitors, such as Teradata."
"It's difficult today to compete with open-source solutions. In these areas, there is a lot of competition and the price of this solution is a bit pricy."
"From a cost perspective, the software is less than most of its competitors."
"The first TB is free and you can use all the Vertica features. After 1TB you have to pay for licensing. The product is worth it, but be aware of this condition, and plan. The compression ratio is explained in the documentation."
"Work with a vendor, if possible, and take advantage of more aggressive discounts at mid-fiscal year (April) and fiscal year-end (October).​"
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
824,067 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
15%
Educational Organization
11%
Manufacturing Company
8%
Financial Services Firm
18%
Computer Software Company
18%
Manufacturing Company
8%
Energy/Utilities Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
What do you like most about Vertica?
Vertica is easy to use and provides really high performance, stability, and scalability.
What is your experience regarding pricing and costs for Vertica?
The solution is relatively cost-effective. Pricing and licensing are reasonable compared to other solutions.
What needs improvement with Vertica?
The product could improve by adding support for a wider variety of data types and enhancing features to better compete with other databases.
 

Also Known As

No data available
Micro Focus Vertica, HPE Vertica, HPE Vertica on Demand
 

Learn More

 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Cerner, Game Show Network Game, Guess by Marciano, Supercell, Etsy, Nascar, Empirix, adMarketplace, and Cardlytics.
Find out what your peers are saying about Cloudera Distribution for Hadoop vs. Vertica and other solutions. Updated: December 2024.
824,067 professionals have used our research since 2012.