Try our new research platform with insights from 80,000+ expert users

Matillion Data Productivity Cloud vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Mar 31, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Matillion Data Productivity...
Average Rating
8.4
Reviews Sentiment
7.4
Number of Reviews
26
Ranking in other categories
Cloud Data Integration (6th)
StreamSets
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
Data Integration (15th)
 

Featured Reviews

Jacques Du Preez - PeerSpot reviewer
Offers good user interface and easy to navigate
Our experience is mainly around using Snowflake Data Cloud with Matillion. And the two of them combined, offer superior performance and price point benefits. Using them together is very efficient because Matillion's processing and Snowflake's own scalability and consumption based on pushing down code work well together. They are very efficient. We've used Azure Data Factory (ADF) for integration. So that's an alternative, depending on customer choice for the integration. And then on AWS, a few of the other services, like Terraform, and S3 storage, and that's about it. It's on a case-by-case basis. If they've chosen ADF as a technology, then we implement using that choice.
Nantabo Jackie - PeerSpot reviewer
Simplified pipelines and helped us break down data silos within our organization
The design experience when implementing batch streaming or ECL pipelines is very easy and straightforward. When we initially attempted to integrate StreamSets with Kafka, it was somewhat challenging until we consulted the documentation, after which it became straightforward. We use StreamSets to move data into modern analytics platforms. Moving the data into modern analytics platforms is still complex. It requires a lot of understanding of logic. StreamSets enables us to build data pipelines without knowing how to code. StreamSets' ability to build data pipelines without requiring us to know complex programming is very important, as it allows us to focus on our projects without spending time writing code. StreamSets' Transformer for Snowflake is simple to use for designing both simple and complex transformation logic. StreamSets' Transformer for Snowflake is extremely important to me as it helps me to connect external data sources and keep my internal workflow organized. Transformer for Snowflake's functionality is a perfect ten out of ten. It is important and cost-effective that Transformer for Snowflake is a serverless engine embedded within the platform, as without this feature, it would be very expensive. This feature helps us to sell at lower budget costs, which would otherwise be at a high cost with other servers. StreamSets has helped improve our organization. StreamSets simplified pipelines for our organization. It is easier to complete a project when we know where and how to start, and working with the team remotely makes it more efficient. This helps us to save time and be more organized when creating data pipelines. Being a structured company that produces reliable resources for our application benefits both our clients and contacts. StreamSets' built-in data drift resilience plays a part in our ETL operations. With prior knowledge, the built-in data drift resilience is very effective, but it can be challenging to implement without the preexisting knowledge. The built-in data drift resilience reduced the time it takes us to fix data drift breakages by 45 percent. StreamSets helped us break down data silos within our organization. The use of StreamSets to break down data silos enabled us to be confident in the services and products we provide, as well as the real-time streaming we offer. This has had a positive impact on our business, as it allowed us to accurately determine the analytics we need to present to stakeholders, clients, and our sources while ensuring that the process is secure and transparent. StreamSets saved us time because anyone can use StreamSets not just developers. We can save around 40 percent of our time. StreamSets' reusable assets helped us reduce workload by around 25 percent. StreamSets saved us money by not having to hire developers with specialized skills. We saved around $2,000 US. StreamSets helped us scale our data operations. Since StreamSets makes it easy to scale our data operations, it enabled us to know exactly where to start at any time. We are aware of the timeline for completing the project, and depending on our familiarity with the software, we can come up with a solution quickly.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is an incredibly user-friendly and intuitive tool, making the learning curve quite smooth"
"The technical support treats us well. They already have a support portal, and they are responsive, which helps."
"The product is quite stable and can handle complex data integration tasks well."
"The solution's most valuable feature is the CDC (Change Data Capture) component."
"It's been able to do everything we require."
"Matillion ETL has great Git integration that is perfect and convenient to use."
"The product's initial setup phase was easy."
"The most valuable feature of Matillion ETL is the ETL. The solution is open-source which provides advantages, such as good performance and high efficiency. Additionally, it supports three data types which eliminates predefining the data, and we can write script models in Python."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
"In StreamSets, everything is in one place."
 

Cons

"One of the features that's in development is data privacy in the cloud, along with further SAP integration. For connectivity to SAP systems."
"Matillion’s on-premises capabilities don’t allow you to build something customized."
"Matillion ETL should include more enhanced capabilities for extracting data from the SAP systems."
"The improvement area could be possible if the tool provides better integration capabilities with other ecosystems, including governance tools or data cataloging tools, as it is currently an area where the solution is lacking."
"Performance can be improved for efficiency, and it can be made faster."
"Unlike Snowflake which automatically takes care of upgrading to the latest version and includes additional features, with Matillion ETL we need to do this ourselves."
"To complete the pipeline, they might want to include some connectors which would put the data into different platforms. This would be helpful."
"Sometimes, we have issues with the solution's stability and need to restart it for three weeks or more."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"Visualization and monitoring need to be improved and refined."
"In terms of the product, I don't think there is any room for improvement because it is very good. One small area of improvement that is very much needed is on the knowledge base side. Sometimes, it is not very clear how to set up a certain process or a certain node for a person who's using the platform for the first time."
"StreamSets should provide a mechanism to be able to perform data quality assessment when the data is being moved from one source to the target."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
 

Pricing and Cost Advice

"The prices needs to be lower."
"It is cost-effective. Based on our use case, it's efficient and cheap. It saves a lot of money and our upfront costs are less."
"The price of Matillion ETL is reasonable."
"Its price depends on what you expect. You pay on a monthly basis, but there is a possibility to have special contracts depending on the installation."
"Purchasing it through the AWS Marketplace is pretty convenient. There is a little bit of back and forth in terms of the licensing based on the machine size, but it seems to have worked out well. it is convenient to have it all as part of our AWS billing."
"Matillion ETL is expensive."
"The pricing depends on what edition the customer opts for. For example, the standard edition is priced at $2.00 per credit. And you are only charged when you use it. You're not charged when it's idle."
"A rough estimation of the cost is around 20,000 dollars a month, however, this is dependent on the machine used and how Matillion ETL is used."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"We are running the community version right now, which can be used free of charge."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"StreamSets is an expensive solution."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
845,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
18%
Computer Software Company
15%
Manufacturing Company
9%
Energy/Utilities Company
5%
Financial Services Firm
14%
Computer Software Company
11%
Manufacturing Company
10%
Insurance Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Matillion ETL?
The new version with the Productivity Cloud is very simple. It's easy to use, navigate, and understand.
What is your experience regarding pricing and costs for Matillion ETL?
The solution's pricing is not based on the licensing cost but on the running hours when the Matillion instance is up and running. Its pricing model is different from the traditional pricing models ...
What needs improvement with Matillion ETL?
Depending on the use case, the solution's pricing could be improved. Matillion ETL should include more enhanced capabilities for extracting data from the SAP systems.
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Also Known As

Matillion ETL for Redshift, Matillion ETL for Snowflake, Matillion ETL for BigQuery
No data available
 

Overview

 

Sample Customers

Thrive Market, MarketBot, PWC, Axtria, Field Nation, GE, Superdry, Quantcast, Lightbox, EDF Energy, Finn Air, IPRO, Twist, Penn National Gaming Inc
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Matillion Data Productivity Cloud vs. StreamSets and other solutions. Updated: March 2025.
845,406 professionals have used our research since 2012.