Try our new research platform with insights from 80,000+ expert users

Informatica Intelligent Data Management Cloud (IDMC) vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Informatica Intelligent Dat...
Ranking in Data Integration
3rd
Average Rating
7.8
Reviews Sentiment
6.8
Number of Reviews
182
Ranking in other categories
Data Quality (1st), Business Process Management (BPM) (6th), Business-to-Business Middleware (3rd), API Management (7th), Cloud Data Integration (3rd), Data Governance (2nd), Test Data Management (3rd), Cloud Master Data Management (MDM) Solutions (1st), Data Management Platforms (DMP) (1st), Data Masking (2nd), Metadata Management (1st), Test Data Management Services (2nd), Product Information Management (PIM) (1st), Data Observability (2nd)
StreamSets
Ranking in Data Integration
10th
Average Rating
8.4
Reviews Sentiment
7.1
Number of Reviews
20
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of February 2025, in the Data Integration category, the mindshare of Informatica Intelligent Data Management Cloud (IDMC) is 5.0%, down from 7.8% compared to the previous year. The mindshare of StreamSets is 1.6%, up from 1.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Raj Sethupathi - PeerSpot reviewer
Offers profiling and address standardization but can be complicated
Informatica Data Quality has its data warehouse, primarily using Oracle and some SQL databases. You need a database to host the data. The cleansed version of the data is stored in the data warehouse. It integrates with PowerCenter and other Informatica tools. The integration details can be complex, but a regional setup is involved in this process. Profiling smaller datasets, such as 10,000-50,000 records, worked fine. However, unexpected issues could arise with larger datasets, such as thousands of records or more, especially with tables containing many columns. Handling tables with fifty or more columns can be challenging, even in Excel. A mismatch in data types could cause the entire system to crash. Continual enhancements are being made to address these issues, which can be unique to specific industries like finance and healthcare.
Reyansh Kumar - PeerSpot reviewer
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I do a quite a lot of data transformations, and the fact that I can do them without changing any of my SQL queries from the code, using the inbuilt tools, is very helpful."
"I like that Informatica MDM has robust matching technology. Informatica MDM is also porting the external Java applications for validations. I can consider that a must-have. It is also exposed to Rest API calls, and we can engage in real-time integrations with any third-party systems."
"It is one of the best tools available for data integration."
"It is a scalable solution. Scalability-wise, I rate the solution a nine out of ten."
"I have rated the stability a ten out of ten due to a high level of satisfaction."
"The Mapping Designer allows for declarative ETL development (visual scripting) that leverages a wide array of different transformations."
"I know that there are two good features, APN and ServiceNow but we haven't explored all of its features yet."
"The solution is applicable for both technical and business users."
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"It is really easy to set up and the interface is easy to use."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
 

Cons

"This solution is hard to set up and its interface is not user-friendly. It's also not as stable, and the technical support takes a lot of time to solve simple problems."
"The tool should provide a unified user interface to manage the data objects."
"The UI is terrible and not user-friendly."
"They could improve technical support because it is not good enough at the moment."
"It would be helpful if there was a GenAI feature integrated into the system, especially regarding the data quality."
"Its pricing model can be improved. The response time from technical support can also be improved."
"Once the data is masked, we won't be able to reverse it back to its original value."
"Enhancements on the UI front, such as multiple templates and improved grid views, would be beneficial."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"StreamSet works great for batch processing but we are looking for something that is more real-time. We need latency in numbers below milliseconds."
"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."
 

Pricing and Cost Advice

"The solution's pricing model is easy, but it is very expensive."
"It's pretty high for us. It's more on the higher side, like low to middle high."
"The pricing is quite flexible."
"The pricing is high compared to other tools on the market."
"The product is very expensive"
"Informatica Cloud Data Integration is famously known for its high price. The vendor targets large enterprises, and not medium or small companies. These large companies, and organizations, handle large amounts of data. If you go into any large bank, such as American or Canadian banks, these banks use this solution because it is more reliable, secure, and has more functionality."
"Our customers sometimes are able to negotiate a much better price for Informatica Cloud Data Integration based on their relationship with the vendor."
"Informatica MDM's pricetag should come down. They have to cut some costs."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"We are running the community version right now, which can be used free of charge."
"StreamSets is an expensive solution."
"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"It's not so favorable for small companies."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
838,713 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
12%
Manufacturing Company
10%
Government
6%
Financial Services Firm
17%
Computer Software Company
11%
Manufacturing Company
10%
Insurance Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
Which Informatica product would you choose - PowerCenter or Cloud Data Integration?
Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge...
What are the biggest benefits of using Informatica Cloud Data Integration?
When it comes to cloud data integration, this solution can provide you with multiple benefits, including: Overhead reduction by integrating data on any cloud in various ways Effective integration ...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Also Known As

ActiveVOS, Active Endpoints, BPM, Address Verification, Persistent Data Masking, Cloud Test Data Management, PIM, , Enterprise Data Catalog, Data Integration Hub, Cloud Data Integration, Data Quality, Cloud API and App Integration
No data available
 

Overview

 

Sample Customers

The Travel Company, Carbonite
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Informatica Intelligent Data Management Cloud (IDMC) vs. StreamSets and other solutions. Updated: February 2025.
838,713 professionals have used our research since 2012.