Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs Neo4j Graph Database comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in NoSQL Databases
7th
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
Hadoop (2nd)
Neo4j Graph Database
Ranking in NoSQL Databases
9th
Average Rating
8.6
Reviews Sentiment
7.9
Number of Reviews
5
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the NoSQL Databases category, the mindshare of Cloudera Distribution for Hadoop is 2.4%, down from 2.7% compared to the previous year. The mindshare of Neo4j Graph Database is 3.6%, up from 3.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases
 

Featured Reviews

Shahan Rehman - PeerSpot reviewer
Can host multiple technologies and help businesses with their AI initiatives
The ease or difficulty in setting up the product depends on the environment of the customer where the tool is deployed. If a banking, industrial, or retail sector firm is taken into concentration, depending on how big of a database is maintained, including the applications that are to be hosted, the deployment process can range from a simple to a very complex phase, depending on the architecture. For Cloudera Distribution for Hadoop, one has to go through the usual deployment process, like for any software product. You have to have different environments before going into production, like pre-production environments, test and dev environments. You install and configure all the components in the test environment and then test them on the pre-production environment. Once UAT is done, you move them to the production environment. In general, it's a critical product deployed in a company.
AR
Easy to use and not so expensive
For first-time users, if you don't know much about the tool, I think you should go with a document-based DBMS tool. I had to use the tool because I learned about it in college. My advice to others is that they need to learn about the tool, nodes, and vertices and then purchase Neo4j Graph Database. It will be a little bit difficult for new users to know what is the meaning of a node, what vertices are, how to use it, or how an application can use it. The tool is easy for new users as it is an intuitive tool. I rate the tool a nine out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"We had a data warehouse before all the data. We can process a lot more data structures."
"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
"In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues."
"The most valuable feature is Kubernetes."
"We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The main advantage is the storage is less expensive."
"The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions."
"The solution's best feature is how it differs from traditional SQL databases. It's hard to map people and find those near me in SQL, which requires long, complex queries. Neo4j Graph Database makes this easier with simpler queries. It also supports more data types, like JSON, which SQL doesn't."
"For now, the tool doesn't break down or stop, so it is quite stable."
"As a graph database, I am surprised at their performance and response time."
"Enables people to understand what the business problem is and how the technology helps."
"Creates the ability to visualize outputs."
 

Cons

"There are multiple bugs when we update."
"The governance aspect of the solution should be improved."
"The initial setup of Cloudera is difficult."
"The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case."
"The pricing needs to improve."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there is a lot of things that need to improve."
"The dashboard could be improved."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"There are concerns about performance and whether the tool can necessarily scale to provide the solution."
"The tool could improve by having more resources, especially for Golang, which we use. It lacks good basic libraries and doesn't have an ORM (Object-Relational Mapping) tool, which many NoSQL databases have. We thought about building an ORM for the Neo4j Graph Database but are too busy."
"So far, we have not had any issues and are happy with the product in general."
"For me, when the tool was deployed on an on-premises model, it was a little bit difficult the first time."
 

Pricing and Cost Advice

"The price is very high. The solution is expensive."
"It is an expensive product."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The solution is expensive."
"I believe we pay for a three-year license."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"The pricing must be improved."
"The product’s price depends from project to project."
"The solution is open source so that you can use it for free. They also offer an enterprise version with its billing. If your company is earning well, I suggest using the enterprise version. Otherwise, you can deploy it on your own cloud and pay based on usage."
"The tool is not expensive."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
15%
Educational Organization
10%
Manufacturing Company
8%
Computer Software Company
18%
Financial Services Firm
15%
Energy/Utilities Company
9%
University
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
What is your experience regarding pricing and costs for Neo4j?
The solution is open source so that you can use it for free. They also offer an enterprise version with its billing. If your company is earning well, I suggest using the enterprise version. Otherwi...
What needs improvement with Neo4j Graph Database?
The tool could improve by having more resources, especially for Golang, which we use. It lacks good basic libraries and doesn't have an ORM (Object-Relational Mapping) tool, which many NoSQL databa...
What is your primary use case for Neo4j Graph Database?
We're building a social media platform, which is a great use case for the product. It helps connect people. For example, if we're friends on Facebook, I can get suggestions for people near me or re...
 

Learn More

 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Walmart, Telenor, Wazoku, Adidas, Cerved, GameSys, eBay, Schleich, ICIJ, die Bayerisch, Megree, InfoJobs, LinkedIn
Find out what your peers are saying about Cloudera Distribution for Hadoop vs. Neo4j Graph Database and other solutions. Updated: October 2024.
816,406 professionals have used our research since 2012.