Try our new research platform with insights from 80,000+ expert users

Informatica Intelligent Data Management Cloud (IDMC) vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Informatica Intelligent Dat...
Ranking in Data Integration
3rd
Average Rating
7.8
Number of Reviews
180
Ranking in other categories
Data Quality (1st), Business Process Management (BPM) (5th), Business-to-Business Middleware (3rd), API Management (8th), Cloud Data Integration (3rd), Data Governance (2nd), Test Data Management (3rd), Cloud Master Data Management (MDM) Solutions (1st), Data Management Platforms (DMP) (2nd), Data Masking (2nd), Metadata Management (1st), Test Data Management Services (3rd), Product Information Management (PIM) (1st), Data Observability (2nd)
StreamSets
Ranking in Data Integration
9th
Average Rating
8.4
Number of Reviews
24
Ranking in other categories
No ranking in other categories
 

Featured Reviews

MP
Jul 24, 2024
Powerful tool to create data warehouse solution with accessibility and features
We use the solution for ETL It's a powerful tool to create data warehouse solutions for our clients. We take unstructured data and use Informatica Cloud to structure it in various ways, making it usable on a reporting front end. This enables clients to make informed business decisions based on…
JM
Mar 30, 2023
Enables us to create streams and pipelines that our analytics team can utilize to identify areas for improvement
We use StreamSets' ability to connect to enterprise data stores such as Kafka. It is easy and simple to connect enterprise data stores as long as we follow the documentation. We use StreamSets' ability to move data into the analytic platforms easily because we can use the template provided to extract data from the pipeline. Being able to use Transformer for Snowflake to design both simple and complex transformation logic is important because it helps us break out a live amount of data interfaces that can be understood by the analytics team and identify areas of improvement. As the Transformer for Snowflake operates as a serverless engine, we can reduce our costs as we no longer need to purchase servers. StreamSets enables us to create streams and pipelines that our analytics team can utilize to identify areas for improvement. Additionally, our marketing team can leverage the data generated from these reports to understand how we can integrate our products and services to benefit our brand. StreamSets' data drift resilience is effective and user-friendly. We can use templates or use them from scratch. Data drift resilience saves us around 35 percent of the time fixing duplicates. StreamSets has helped us break down data silos within our organization by providing a clear path forward and enhancing our productivity by breaking down a large amount of data that we can understand. StreamSets saved us around 40 percent of our time. We can use a small team using StreamSets to create data pipelines that would normally require an expert that costs around $500 per month. StreamSets helps us scale our operations because we understand the quality of the data we have and how we can integrate the data into our marketing needs.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The interface has a great look and feel, and the functionality is so easy."
"I like that Informatica MDM has robust matching technology. Informatica MDM is also porting the external Java applications for validations. I can consider that a must-have. It is also exposed to Rest API calls, and we can engage in real-time integrations with any third-party systems."
"It has improved our organization because it has made our data more reliable. Data is the most important asset these days, and in order to trust your data, you need these tools to make sure that your data is clean and reliable."
"The solution's initial setup is quite straightforward."
"It can automatically connect or associate business terms with various options, providing flexibility beyond general capabilities."
"The MDM solution is capable of integrating multiple systems, so it helped us to solve the purpose of centralizing the depository as well as the standardization of mass data. It takes away all the ambiguity around data integrity issues or all the process challenges which happen when every stage of a process uses a different source as master data."
"We can see all our information on a single screen."
"The interface is really good."
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"The ability to have a good bifurcation rate and fewer mistakes is valuable."
"The scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy."
"The UI is user-friendly, it doesn't require any technical know-how and we can navigate to social media or use it more easily."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption."
"It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution."
"The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up."
 

Cons

"The cloud version could be better in terms of pricing."
"The tool should provide a unified user interface to manage the data objects."
"The configurations could be better. It is a bit confusing because we must develop two tools when building a data model in Informatica MDM. Even though Informatica MDM is a single tool, we have our hub console plus the provisioning tool within that. Whatever data model we are building in the hub console, we have to develop it in the provisioning tool again. It is double the work to create a data model. We are also using external calls or the Java custom plans functions. This can be both positive and negative. Since MDM as a client does not support any complex validation, we have to depend on the external call or a Java call. Every time we deployed, the entire solution was impacted if something went wrong."
"Certain shortcomings in the product's UI make it an area where improvements are required."
"Its look and feel needs improvement. It has a lousy look and feel. Informatica PIM is designed specifically for the retail industry. They need to make sure that it is also applicable to all the other industries and verticals."
"The biggest pain point for us is the documentation. Typically, you have to go through knowledge forums and knowledge groups to find out about the syntax issues for interfacing with new products. Typically, you've got to deal with someone who has been through the pain before. Their documents are not really up to date with current innovations happening in the industry. As big as they are, you can't really expect it, but that's our pain point."
"It is more complicated to extract data using the product compared to Visio. The system could display the details on the screen."
"It would be great if the cloud version could match the full range of capabilities available on-premise."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
"I would like to see further improvement in the UI. In addition, upgrades are not automatic and they should be automated. Currently, we have to manually upgrade versions."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
 

Pricing and Cost Advice

"I rate the product's pricing a five on a scale of one to ten, where one is cheap and ten is expensive."
"The product has a high price point."
"There is no doubt that it is very expensive, but the brand value comes at a cost. Other MDM solutions in the market that haven't proven themselves like Informatica are also pretty expensive. We need to understand that MDM itself is very expensive to implement. So, Informatica is also pretty expensive. I would rate it a two out of five for being pretty expensive."
"I have no idea what the price actually is. It is probably not going to be the cheapest, but it is a pretty stable and robust platform from the backend standpoint."
"You pay for this solution based on IPUs, Informatica Processing Units. This depends on how much data you process and how much memory you consume from the cloud provider, and you pay as you go."
"The price is very high and has become a big concern for our customers who require the solution in order for their business to function smoothly."
"Informatica Axon is a costly solution. I rate Informatica Axon a four out of ten for its pricing."
"Licensing is difficult to understand, but the team is always available to explain anything. They are very helpful."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"The pricing is good, but not the best. They have some customized plans you can opt for."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"StreamSets is an expensive solution."
"It's not expensive because you pay per month, and the tasks you can perform with it are huge. It's reliable and cost-effective."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
"We are running the community version right now, which can be used free of charge."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
801,394 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
10%
Insurance Company
6%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
10%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
Which Informatica product would you choose - PowerCenter or Cloud Data Integration?
Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge...
What are the biggest benefits of using Informatica Cloud Data Integration?
When it comes to cloud data integration, this solution can provide you with multiple benefits, including: Overhead reduction by integrating data on any cloud in various ways Effective integration ...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Also Known As

ActiveVOS, Active Endpoints, BPM, Address Verification, Persistent Data Masking, Cloud Test Data Management, PIM, , Enterprise Data Catalog, Data Integration Hub, Cloud Data Integration, Data Quality, Cloud API and App Integration
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

The Travel Company, Carbonite
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Informatica Intelligent Data Management Cloud (IDMC) vs. StreamSets and other solutions. Updated: September 2024.
801,394 professionals have used our research since 2012.