

Cloudera Distribution for Hadoop and MarkLogic are solutions in the Big Data and database management sectors. Cloudera shows an advantage in scalability and integration, while MarkLogic provides strong data handling capabilities for enterprises needing advanced data management.
Features: Cloudera Distribution for Hadoop offers robust scalability, a comprehensive ecosystem, and strong capabilities for big data analytics. MarkLogic focuses on agility, semantic search, and the ability to manage complex datasets natively, beneficial for rapid data processing and transformation.
Ease of Deployment and Customer Service: Cloudera benefits from extensive community support and documentation, which aids businesses with existing Hadoop infrastructure. MarkLogic provides innovative deployment options and superior customer service, simplifying setup for companies prioritizing customer service.
Pricing and ROI: Cloudera offers a cost-effective solution with its open-source nature, supporting gradual ROI. MarkLogic presents a higher investment but tends to deliver quicker ROI for enterprises focused on complex data operations.
For example, by using MarkLogic to handle semi-structured data directly, I have reduced ETL prep and transformation time by roughly 30 to 40 percent, freeing up engineers to focus on more value-added tasks instead of manual data cleaning.
This led to roughly a thirty to forty percent reduction in backend development effort.
Ultimately, it reduced development complexity and effort noticeably, especially by eliminating the need to manage multiple systems.
The technical support is quite good and better than IBM.
Customer support for MarkLogic provides strong enterprise-level assistance through direct interactions.
MarkLogic support has enterprise-grade support, including ticketing systems and dedicated support channels for customers.
I would rate MarkLogic's customer support an eight due to its responsiveness, especially for higher priority issues.
In production, when you get to know that your data is increasing and you need to add one more node, that is not easy and not straightforward.
MarkLogic is highly scalable and supports horizontal scaling through its clustered architecture.
MarkLogic is designed to scale horizontally, which means you can add more nodes to the cluster to handle increased data and query load.
We faced challenges but overcame those challenges successfully.
It supports ACID transactions, which ensure data consistency and reliability.
The built-in replication and failover features also help maintain uptime, ensuring the system stays operational even during maintenance or updates.
It can be used in different environments and is designed for enterprise use cases involving large volumes of data and complex queries.
Integrating with Active Directory, managing security, and configuration are the main concerns.
You do not need to worry about maintaining your own servers or provisioning your own servers. You simply log in and tell MarkLogic you want a certain number of clusters or nodes in a cluster and what cloud provider you want to use, then click okay, and they will build it for you.
There is a steep learning curve for this technology; XQuery and internal concepts such as indexing and CTS queries take time to learn compared to more common databases such as MongoDB.
Cost and licensing can be a consideration, especially for smaller teams or startups compared to open-source alternatives.
It can be deployed on-premises, unlike competitors' cloud-only solutions.
The initial setup cost is moderate to high, mainly due to infrastructure provisioning, licensing costs, and initial configuration and onboarding efforts.
MarkLogic is quite costly, and they are looking to move away in the longer run for that reason.
MarkLogic follows a licensing model that can be relatively higher compared to open-source databases, making cost an important factor for smaller teams.
This is the only solution that is possible to install on-premise.
It has a very rich search and cts APIs to build search engines on large datasets.
I personally appreciate the built-in search feature because it indexes all data immediately upon ingestion for rapid searching, so we can perform full-text, phrase, or geospatial searches.
MarkLogic provides a Google search-like capability, including full-text search, partial matching, and relevance scoring.
| Product | Mindshare (%) |
|---|---|
| Cloudera Distribution for Hadoop | 4.5% |
| MarkLogic | 2.5% |
| Other | 93.0% |
| Company Size | Count |
|---|---|
| Small Business | 16 |
| Midsize Enterprise | 9 |
| Large Enterprise | 31 |
| Company Size | Count |
|---|---|
| Small Business | 2 |
| Midsize Enterprise | 4 |
| Large Enterprise | 8 |
We monitor all NoSQL Databases reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.