

Find out in this report how the two Hadoop solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
We do not feel we're getting value for the investment due to the additional resources needed for integration and maintenance.
I have seen a return on investment with SAP HANA as it typically delivers measurable ROI, especially in finance and month-end close processes.
SAP HANA is a cost-saving solution that helps save money and serves as a time-saving solution that helps save time.
I have received support via newsgroups or guidance on specific discussions, which is what I would expect in an open-source situation.
I would rate the technical support of Apache Spark an eight because when we had questions, we found solutions, and it was straightforward.
The community support is better than the official SAP support.
While issues are resolved eventually, the first level of support is not as good as we would like.
When we raise a ticket for AWS or Azure issues, we find their support is very proactive, while SAP's support is quite passive.
SAP HANA handles large data volumes efficiently, allowing for adding more modules or users without performance issues.
Our operations have grown from a hundred data operations a day to as many in a couple of seconds.
It depends on what needs to change in the structure, as some entities cannot be changed and require a new implementation, while others can be modified.
Apache Spark resolves many problems in the MapReduce solution and Hadoop, such as the inability to run effective Python or machine learning algorithms.
Without a doubt, we have had some crashes because each situation is different, and while the prototype in my environment is stable, we do not know everything at other customer sites.
We recently faced customer data loss during the cluster handover or failover fallback.
We have not had any problems in the last seven to eight years.
Regarding stability, they are using legacy systems and have implemented SAP HANA.
Various tools like Informatica, TIBCO, or Talend offer specific aspects, licensing can be costly;
I find that there really lacks the technical depth to do any recommendations for future updates of Apache Spark.
The main issue is the ecosystem, which lacks the widespread support that SQL enjoys.
The setup process and deployment process for SAP HANA is complex.
Licensing costs with SAP HANA are very high.
It's a recurring subscription model, which is expensive compared to legacy systems with just a maintenance fee.
SAP is not a cheap company, and its licenses are expensive.
I would rate the price for SAP HANA as high.
Not all solutions can make this data fast enough to be used, except for solutions such as Apache Spark Structured Streaming.
The most important part is that everything can be connected, and the data exchange across overseas connections is fast and reliable.
The solution is beneficial in that it provides a base-level long-held understanding of the framework that is not variant day by day, which is very helpful in my prototyping activity as an architect trying to assess Apache Spark, Great Expectations, and Vault-based solutions versus those proposed by clients like TIBCO or Informatica.
This architecture allows for faster data processing and real-time analytics that were not possible with traditional databases.
The concept enhances speed, allowing the database to serve and move data quickly.
One of our dashboards using Excelsius was previously developed on normal BW on Oracle data, which took 10 minutes to open. After developing the same calculation views using those tables and replacing them with calculation views in Excelsius, the dashboard opened in seconds.

| Company Size | Count |
|---|---|
| Small Business | 28 |
| Midsize Enterprise | 16 |
| Large Enterprise | 32 |
| Company Size | Count |
|---|---|
| Small Business | 26 |
| Midsize Enterprise | 12 |
| Large Enterprise | 62 |
Apache Spark is a leading open-source processing tool known for scalability and speed in managing large datasets. It supports both real-time and batch processing and is widely used for building data pipelines, machine learning applications, and analytics.
Apache Spark's strengths lie in its ability to process large data volumes efficiently through real-time and batch capabilities. With in-memory computation, it ensures fast data processing and significant performance gains. Its wide range of APIs, including those for machine learning, SQL, and analytics, make it versatile in handling complex data operations. While popular for ease of use and fault tolerance, Spark's management, debugging, and user-friendliness could benefit from improvements. Better GUIs, integration with BI tools, and enhanced monitoring are desired, alongside shuffling optimization and compatibility with more programming languages.
What are Apache Spark's key features?Organizations use Apache Spark predominantly for in-memory data processing, enabling seamless integration with big data frameworks. It's applied in security analytics, predictive modeling, and helps facilitate secure data transmissions in AI deployments. Industries leverage Spark's speed for sentiment analysis, data integration, and efficient ETL transformations.
SAP HANA is a high-speed in-memory data platform providing real-time analytics and quick data retrieval while supporting enterprise scalability and integration.
Designed for efficient data processing, SAP HANA offers robust security, ease of use, and advanced analytics through an architecture that ensures flexibility and stability. This allows seamless integration with third-party systems, significantly improving data management and decision-making capabilities. Enterprises benefit from data modeling, compression, comprehensive dashboards, and real-time insights. However, users have noted high costs, speed, and scalability issues, as well as complexity in interface and customizations. There's also a need for better integration with third-party tools, clearer documentation, and enhanced connectivity with Java and other platforms.
What are the key features of SAP HANA?SAP HANA is deployed for enterprise resource planning across industries, notably in finance, HR, and sales. It supports real-time analytics, business intelligence, and integrates with SAP ERP and S/4HANA, offering tailored solutions for sectors like supply chain management, CRM, procurement, and financial analysis.
We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.