Try our new research platform with insights from 80,000+ expert users

Informatica Intelligent Data Management Cloud (IDMC) vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Informatica Intelligent Dat...
Ranking in Data Integration
3rd
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
181
Ranking in other categories
Data Quality (1st), Business Process Management (BPM) (7th), Business-to-Business Middleware (3rd), API Management (7th), Cloud Data Integration (3rd), Data Governance (2nd), Test Data Management (3rd), Cloud Master Data Management (MDM) Solutions (1st), Data Management Platforms (DMP) (2nd), Data Masking (2nd), Metadata Management (1st), Test Data Management Services (4th), Product Information Management (PIM) (1st), Data Observability (2nd)
StreamSets
Ranking in Data Integration
9th
Average Rating
8.4
Reviews Sentiment
7.5
Number of Reviews
24
Ranking in other categories
No ranking in other categories
 

Featured Reviews

Raj Sethupathi - PeerSpot reviewer
Offers profiling and address standardization but can be complicated
Informatica Data Quality has its data warehouse, primarily using Oracle and some SQL databases. You need a database to host the data. The cleansed version of the data is stored in the data warehouse. It integrates with PowerCenter and other Informatica tools. The integration details can be complex, but a regional setup is involved in this process. Profiling smaller datasets, such as 10,000-50,000 records, worked fine. However, unexpected issues could arise with larger datasets, such as thousands of records or more, especially with tables containing many columns. Handling tables with fifty or more columns can be challenging, even in Excel. A mismatch in data types could cause the entire system to crash. Continual enhancements are being made to address these issues, which can be unique to specific industries like finance and healthcare.
Reyansh Kumar - PeerSpot reviewer
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It's a stable product without any bugs or glitches."
"Informatica Cloud Data Integration is stable."
"The most valuable feature of Informatica Axon is that it is flexible and user-friendly."
"It allows the creation of identical tasks that can be applied across multiple tables, ranging from 100 to even 1,000 tables."
"Performance and flexibility-wise, they're very user-friendly."
"The best thing about Informatica Axon is that it integrates with the Electronic Data Capture and the Axon system."
"The user interface which is very easy to use if we have any problems to solve."
"The feature of auto-onboarding of the assets, enterprise assets via EDC is good."
"Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"It is really easy to set up and the interface is easy to use."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
 

Cons

"Data integration should be improved."
"Informatica is very expensive."
"Informatica Axon needs to improve its interface."
"Metadata querying is right now not there in Informatica Cloud Data Integration."
"The scalability is tough."
"Informatica Axon needs more integration connectors so that it can connect to systems and different kinds of datasets."
"Informatica MDM has a complex user interface, which could be improved."
"We'd like to see the microservices, which don't run yet because the solution is not yet fully cloud-based."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"In terms of the product, I don't think there is any room for improvement because it is very good. One small area of improvement that is very much needed is on the knowledge base side. Sometimes, it is not very clear how to set up a certain process or a certain node for a person who's using the platform for the first time."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"Visualization and monitoring need to be improved and refined."
"Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using."
 

Pricing and Cost Advice

"It's offers value for money. They're more competitive with respect to pricing and offerings."
"Informatica Cloud Data Integration is famously known for its high price. The vendor targets large enterprises, and not medium or small companies. These large companies, and organizations, handle large amounts of data. If you go into any large bank, such as American or Canadian banks, these banks use this solution because it is more reliable, secure, and has more functionality."
"Informatica Axon is a costly solution. I rate Informatica Axon a four out of ten for its pricing."
"The price is neither too high nor too low."
"Pricing is determined by the number of licensed users as well as the number of Core CPUs."
"The licensing price of the product depends on the organization's requirements."
"My understanding is that Informatica is quite expensive compare to other tools that are available in the market."
"Informatica MDM's pricetag should come down. They have to cut some costs."
"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"We are running the community version right now, which can be used free of charge."
"It's not so favorable for small companies."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"It's not expensive because you pay per month, and the tasks you can perform with it are huge. It's reliable and cost-effective."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
10%
Insurance Company
6%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
8%
Insurance Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
Which Informatica product would you choose - PowerCenter or Cloud Data Integration?
Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge...
What are the biggest benefits of using Informatica Cloud Data Integration?
When it comes to cloud data integration, this solution can provide you with multiple benefits, including: Overhead reduction by integrating data on any cloud in various ways Effective integration ...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Also Known As

ActiveVOS, Active Endpoints, BPM, Address Verification, Persistent Data Masking, Cloud Test Data Management, PIM, , Enterprise Data Catalog, Data Integration Hub, Cloud Data Integration, Data Quality, Cloud API and App Integration
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

The Travel Company, Carbonite
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Informatica Intelligent Data Management Cloud (IDMC) vs. StreamSets and other solutions. Updated: November 2024.
816,406 professionals have used our research since 2012.