Try our new research platform with insights from 80,000+ expert users

Databricks vs Informatica Data Engineering Streaming comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Databricks
Ranking in Streaming Analytics
1st
Average Rating
8.2
Number of Reviews
82
Ranking in other categories
Data Science Platforms (1st)
Informatica Data Engineerin...
Ranking in Streaming Analytics
16th
Average Rating
8.0
Number of Reviews
1
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Streaming Analytics category, the mindshare of Databricks is 14.0%, up from 9.6% compared to the previous year. The mindshare of Informatica Data Engineering Streaming is 1.2%, up from 0.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Streaming Analytics
 

Featured Reviews

Dunstan Matekenya - PeerSpot reviewer
Jul 10, 2024
Process large-scale data sets and integrates with Apache Spark with notebook environment
I primarily use Databricks to process large-scale data sets with Apache Spark. My main use case is processing large data sets, such as 600 GB or 800 GB Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of…
DK
May 16, 2024
Helps with real-time data processing and improves decision-making overall
We implement business intelligence solutions, including data warehousing tools, data integration to load data into warehouses, and then creating reports It improves decision-making overall for the company. Informatica is usually the tool for setting up the data, streaming the data into your data…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I like that Databricks is a unified platform that lets you do streaming and batch processing in the same place. You can do analytics, too. They have added something called Databricks SQL Analytics, allowing users to connect to the data lake to perform analytics. Databricks also will enable you to share your data securely. It integrates with your reporting system as well."
"Databricks allows me to automate the creation of a cluster, optimized for machine learning and construct AI machine learning models for the client."
"The Delta Lake data type has been the most useful part of this solution. Delta Lake is an opensource data type and it was implemented and invented by Databricks."
"The time travel feature is the solution's most valuable aspect."
"It's easy to increase performance as required."
"The solution is very simple and stable."
"Databricks gives you the flexibility of using several programming languages independently or in combination to build models."
"The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale."
"It improves the performance."
 

Cons

"The product should provide more advanced features in future releases."
"The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps."
"The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes."
"CI/CD needs additional leverage and support."
"The solution could be improved by adding a feature that would make it more user-friendly for our team. The feature is simple, but it would be useful. Currently, our team is more familiar with the language R, but Databricks requires the use of Jupyter Notebooks which primarily supports Python. We have tried using RStudio, but it is not a fully integrated solution. To fully utilize Databricks, we have to use the Jupyter interface. One feature that would make it easier for our team to adopt the Jupyter interface would be the ability to select a specific variable or line of code and execute it within a cell. This feature is available in other Jupyter Notebooks outside of Databricks and in our own IDE, but it is not currently available within Databricks. If this feature were added, it would make the transition to using Databricks much smoother for our team."
"Support for Microsoft technology and the compatibility with the .NET framework is somewhat missing."
"The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well."
"Would be helpful to have additional licensing options."
"Skill requirement is required. There is a learning curve."
 

Pricing and Cost Advice

"Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
"Price-wise, I would rate Databricks a three out of five."
"The solution is affordable."
"The solution requires a subscription."
"The cost is around $600,000 for 50 users."
"I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
"It is an expensive tool. The licensing model is a pay-as-you-go one."
"We're charged on what the data throughput is and also what the compute time is."
Information not available
report
Use our free recommendation engine to learn which Streaming Analytics solutions are best for your needs.
814,763 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Computer Software Company
12%
Manufacturing Company
9%
Healthcare Company
6%
Financial Services Firm
19%
Manufacturing Company
17%
Computer Software Company
16%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
What needs improvement with Informatica Data Engineering Streaming?
Skill requirement is required. There is a learning curve.
What is your primary use case for Informatica Data Engineering Streaming?
We implement business intelligence solutions, including data warehousing tools, data integration to load data into warehouses, and then creating reports.
What advice do you have for others considering Informatica Data Engineering Streaming?
Overall, I would rate the solution an eight out of ten. Usually, Informatica is for big clients because of its pricing, and it also requires some skill sets. It requires investment into a proper da...
 

Also Known As

Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
Big Data Streaming, Informatica Intelligent Streaming, Intelligent Streaming
 

Overview

 

Sample Customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Jewelry TV
Find out what your peers are saying about Databricks, Amazon Web Services (AWS), Confluent and others in Streaming Analytics. Updated: October 2024.
814,763 professionals have used our research since 2012.