Try our new research platform with insights from 80,000+ expert users

Databricks vs Informatica PowerCenter comparison

Sponsored
 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 8, 2024
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
37
Ranking in other categories
Data Mining (3rd), Data Science Platforms (9th)
Databricks
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
84
Ranking in other categories
Data Science Platforms (1st), Streaming Analytics (1st)
Informatica PowerCenter
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
80
Ranking in other categories
Data Integration (2nd), Data Visualization (8th)
 

Mindshare comparison

Data Science Platforms
Data Integration
 

Featured Reviews

Md Masudul Hassan - PeerSpot reviewer
Comprehensive data analysis capabilities with a user-friendly interface, providing an efficient and reliable platform for researchers and analysts
I believe that offering short-term SPSS licenses, perhaps when customer sourcing is available, could make it more affordable. These licenses shouldn't include features tailored for universities or large sales organizations. Instead, they could offer discounts or additional facilities for smaller entities to access the software. In developing countries, it would be beneficial to provide certain features to users at no cost initially, while also customizing pricing options. For example, offering basic features to the first hundred users can help them become familiar with the software and its capabilities. This approach encourages users to upgrade to higher tiers as they become more experienced and require additional functionality.
Dunstan Matekenya - PeerSpot reviewer
Process large-scale data sets and integrates with Apache Spark with notebook environment
Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of its strengths. Another strength is that the platform makes it very easy to manage resources. For example, setting up a cluster of five or fifteen nodes is straightforward with Databricks. The notebook environment is also excellent, making it easy to perform various tasks.
Lars Borchers - PeerSpot reviewer
A stable and reliable product that provides a variety of features for data integration
The solution is not for newcomers. It has an old touch. The solution must improve the integration with new services. It was part of the program at Informatica when they moved to their cloud platform. It is integrated. However, from an on-premise perspective, we need to buy licenses for PowerExchange. If we want a native driver to access a special service, we need to extend our license to those services. It is expensive. I don't like that it's not all included in the solution.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The features that I have found most valuable are the Bayesian statistics and descriptive statistics."
"The solution has numerous valuable features. We particularly like custom tabs. It's very useful. We end up analyzing a lot of software data, so features related to custom tabs are really helpful."
"Since we are using the software as a statistical tool, I would say the best aspects of it are the regression and segmentation capabilities. That said, I've used it for all sorts of things."
"The best part is that they have an algorithm handbook, so you can open it up and understand how it works, and if it is useful, this is very important."
"IBM SPSS Statistics depends on AI."
"Some of the most valuable features that we are using with some business models are machine learning algorithms, statistical models given to us by the business, and getting data from the database or text files."
"The most valuable features mainly include factor analysis, correlation analysis, and geographic analysis."
"In terms of the features I've found most valuable, I'd say the duration, the correlation, and of course the nonparametric statistics. I use it for reliability and survival analysis, time series, regression models in different solutions, and different types of solutions."
"I work in the data science field and I found Databricks to be very useful."
"The fast data loading process and data storage capabilities are great."
"When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks."
"I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature."
"Automation with Databricks is very easy when using the API."
"It helps integrate data science and machine learning capabilities."
"One of the features provides nice interactive clusters, or compute instances that you don't really need to manage often."
"The technical support is good."
"It provides monitoring and we can therefore be aware of what is happening when we are handling jobs."
"The partitioning and optimization to help enhance our development is a very valuable aspect of Informatica PowerCenter."
"It's a complete package, which is why we use this solution."
"Enterprise-scale ETL solution that's very stable and is easy to scale. It integrates and connects with multiple new systems, both structured and semi-structured."
"The most valuable features are the monitoring tools and the reporting manager."
"It is easy to use, and it is quick for developing things. It is fairly powerful, and it can integrate with a lot of different platforms without much hassle."
"The product's initial setup phase is very easy."
"It is an excellent ETL tool."
 

Cons

"If there is any self-generation data collection plan (DCP), it would be helpful in gathering data. It would also be useful if there is a function to scale it up to, let's say, UiPath and have it consolidate and integrate into a UiPath solution."
"Improvements are needed in the user interface, particularly in terms of user-friendliness."
"This solution is not suitable for use with Big Data."
"There is a learning curve; it's not very steep, but there is one."
"The solution needs to improve forecasting using time series analysis."
"In some cases, the product takes time to load a large dataset. They could improve this particular area."
"The solution needs more planning tools and capabilities."
"Each algorithm could be more adaptable to some industry-specific areas, or, in some cases, adapted for maintenance."
"The integration of data could be a bit better."
"The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well."
"I would like to see the integration between Databricks and MLflow improved. It is quite hard to train multiple models in parallel in the distributed fashions. You hit rate limits on the clients very fast."
"It would be great if Databricks could integrate all the cloud platforms."
"Databricks is not geared towards the end-user, but rather it is for data engineers or data scientists."
"Anyone who doesn't know SQL may find the product difficult to work with."
"Pricing is one of the things that could be improved."
"Databricks may not be as easy to use as other tools, but if you simplify a tool too much, it won't have the flexibility to go in-depth. Databricks is completely in the programmer's hands. I prefer flexibility rather than simplicity."
"The developer tool documentation can be enhanced with a more clear explanation of each utility, accompanied by relevant examples, so that developers are able to create programs with ease."
"What I didn't like about it is that the platform itself is not great at distributed processing. When you need high parallel processing, it has some inherent issues. We had to use Java transformation, and it did not go very well. I have heard that it is going to the cloud, but we haven't tried that."
"The UI is a little outdated."
"There is a need to buy a separate license if one wishes to connect with some kind of SAP system, such as SalesForce."
"This product is going to decommission in the next couple of years."
"The solution could have better documentation on basic steps or blocks that specify what to do."
"Informatica PowerCenter could improve by having a single interface because half of the system is still in the legacy interface and many other elements are moved to the developer client. It would be good if there was a single interface for the end user and developers."
"As a connector to big data, it is not well developed. We've had problems connecting Informatica with Hadoop. The functionality to connect Informatica with Hadoop, for me it's not good."
 

Pricing and Cost Advice

"SPSS is an expensive piece of software because it's incredibly complex and has been refined over decades, but I would say it's fairly priced."
"I rate the tool's pricing a five out of ten."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"The price of IBM SPSS Statistics could improve."
"It's quite expensive, but they do a special deal for universities."
"More affordable training for new staff members."
"Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
"I would rate Databricks' pricing seven out of ten."
"We only pay for the Azure compute behind the solution."
"The licensing costs of Databricks depend on how many licenses we need, depending on which Databricks provides a lot of discounts."
"The solution is affordable."
"We have only incurred the cost of our AWS cloud services. This is because during this period, Databricks provided us with an extended evaluation period, and we have not spent much money yet. We are just starting to incur costs this month, I will know more later on the full cost perspective."
"I rate the price of Databricks as eight out of ten."
"The solution uses a pay-per-use model with an annual subscription fee or package. Typically this solution is used on a cloud platform, such as Azure or AWS, but more people are choosing Azure because the price is more reasonable."
"I rate the solution's pricing a four out of ten. The price is very high, and it doesn't understand the market now."
"The licensing fees are paid on a yearly basis."
"It is for big enterprises. We have leveraged Informatica for big enterprises but not for small and medium enterprises because it is a very costly product as compared to other products. We propose this solution only for enterprise customers. For small to medium enterprises, we would propose the Microsoft solution. Its licensing is currently bundle-wise. It should be features-wise and not bundle-wise."
"Price-wise, it's more expensive than SSIS, but it's a better tool, so it has more features. Licensing is on a yearly basis."
"Compared to other tools, I think PowerCenter is a bit expensive. When I compare it to Oracle, if you want to use Oracle databases, you can easily get an ODI tool, so it's easier to handle. Informatica is a standalone tool—it's an independent company—and there are no databases around them, so it's quite expensive to use. Generally, large companies use PowerCenter because of the price. If companies want to expand their usage areas, they try to consider if it's easy to implement and easy to understand the pricing. I think the pricing is a barrier for Informatica."
"Licensing costs are excessive and pose an obstacle to someone who lacks familiarity with the solution and wishes to have a proper understanding of it."
"Pricing for Informatica PowerCenter isn't cheap, but if I compare it with IBM, it's as expensive as IBM, however, Informatica PowerCenter is more innovative, especially when compared to a giant company such as IBM that has thousands of products. Informatica PowerCenter is limited only to data management, but it has new features that come out every quarter. Points for ease of use and flexibility go to Informatica PowerCenter, but price-wise, IBM and Informatica are equal because they're both expensive."
"The pricing is a little expensive, but in the same range as IBM and other competitors."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
824,053 professionals have used our research since 2012.
 

Comparison Review

it_user90069 - PeerSpot reviewer
Feb 20, 2014
Informatica PowerCenter vs. Microsoft SSIS - each technology has its advantages but also have similarities
Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations…
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
9%
University
8%
Manufacturing Company
8%
Financial Services Firm
16%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
Financial Services Firm
18%
Computer Software Company
12%
Manufacturing Company
7%
Insurance Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
The cost of IBM SPSS Statistics is managed by organizations, not individual researchers. It is a very expensive produ...
What needs improvement with IBM SPSS Statistics?
IBM SPSS Statistics does not keep you close to your data like KNIME. In KNIME, at every stage, you can see the result...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
How does Azure Data Factory compare with Informatica PowerCenter?
Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stabil...
Which is better - SSIS or Informatica PowerCenter?
SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called ...
Which Informatica product would you choose - PowerCenter or Cloud Data Integration?
Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a...
 

Also Known As

SPSS Statistics
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
PowerCenter
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
University of Texas MD Anderson Cancer Center, LexisNexis, Rabobank
Find out what your peers are saying about Databricks vs. Informatica PowerCenter and other solutions. Updated: February 2023.
824,053 professionals have used our research since 2012.