Try our new research platform with insights from 80,000+ expert users

Matillion ETL vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Matillion ETL
Average Rating
8.4
Reviews Sentiment
7.4
Number of Reviews
26
Ranking in other categories
Cloud Data Integration (5th)
StreamSets
Average Rating
8.4
Reviews Sentiment
7.1
Number of Reviews
22
Ranking in other categories
Data Integration (9th)
 

Featured Reviews

AntonHaupt - PeerSpot reviewer
Efficient data integration and transformation with seamless cloud-native integration
In our small business unit, we currently have around four users, with two of them utilizing Matillion within our organization. Considering our growing needs, we're contemplating transitioning to an enterprise SaaS solution where we would share the same instance. Currently, each user is billed individually, but consolidating to a shared instance seems more efficient. Scalability is excellent when using the SaaS solution, easily reaching a rating of ten out of ten. Each data pipeline request is encapsulated within a Docker container and spun off, allowing for instant scalability. Overall, I would rate it a nine out of ten in terms of performance and scalability.
Reyansh Kumar - PeerSpot reviewer
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The loading of data is the most valuable feature of Matillion ETL."
"It has helped us to get onto the cloud quickly."
"It has improved the costs of managing my customer’s data."
"The product is quite stable and can handle complex data integration tasks well."
"The technical support treats us well. They already have a support portal, and they are responsive, which helps."
"It takes less than five minutes to set up and delivers results. It is much quicker than traditional ETL technologies."
"The simplicity of this tool is nice. It has a good graphical user interface. You can also do a lot of generic stuff in the tool. If there is good connectivity to a cloud database, such as Snowflake, and you can have a lot of Snowflake functionality in the tool."
"It is an incredibly user-friendly and intuitive tool, making the learning curve quite smooth"
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"In StreamSets, everything is in one place."
"One of the things I like is the data pipelines. They have a very good design. Implementing pipelines is very straightforward. It doesn't require any technical skill."
"The scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
 

Cons

"Sometimes, we have issues with the solution's stability and need to restart it for three weeks or more."
"While the UI is good, it could be improved in its efficiency and made easier to use."
"Ideally, I would like it to integrate with Secrets Manager as well as the AWS."
"The cost of the solution is high and could be reduced."
"The product must enhance its near-real-time data capture feature."
"In the next release, we would like to have connections to more databases."
"To complete the pipeline, they might want to include some connectors which would put the data into different platforms. This would be helpful."
"I am looking forward to seeing the expansion of the source range for their data loader product."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"Visualization and monitoring need to be improved and refined."
"Using ETL pipelines is a bit complicated and requires some technical aid."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
 

Pricing and Cost Advice

"The prices needs to be lower."
"It was procured through the AWS Marketplace because it keeps things simple. They offer retail-like checkout and bill through your existing Amazon Web Services account."
"The solution is very cheap. You're paying $2.50 an hour and if you set your service up, which you can do, you're not getting charged. Currently, our ETL process is just an overnight process that runs for about an hour. I can start and stop my server just for an hour if I want to and spent $2.50 a day for an ETL solution. There are no additional costs."
"I think it is cost conscious. It used to be very cheap and they have more recently bumped up the pricing, so it is competitive now."
"Matillion ETL is expensive."
"It was very easy to purchase through the AWS Marketplace, but it was also expensive."
"It is not necessarily a cheap solution. However, it's reasonable priced, especially with the smaller machines that we run it on."
"The cost of the solution is high and could be reduced."
"StreamSets is expensive, especially for small businesses."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"It's not so favorable for small companies."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"StreamSets is an expensive solution."
"I believe the pricing is not equitable."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
824,067 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Computer Software Company
14%
Manufacturing Company
9%
Government
6%
Financial Services Firm
17%
Computer Software Company
10%
Manufacturing Company
9%
Insurance Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Matillion ETL?
The new version with the Productivity Cloud is very simple. It's easy to use, navigate, and understand.
What is your experience regarding pricing and costs for Matillion ETL?
The solution's pricing is not based on the licensing cost but on the running hours when the Matillion instance is up and running. Its pricing model is different from the traditional pricing models ...
What needs improvement with Matillion ETL?
Depending on the use case, the solution's pricing could be improved. Matillion ETL should include more enhanced capabilities for extracting data from the SAP systems.
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Comparisons

 

Also Known As

Matillion ETL for Redshift, Matillion ETL for Snowflake, Matillion ETL for BigQuery
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

Thrive Market, MarketBot, PWC, Axtria, Field Nation, GE, Superdry, Quantcast, Lightbox, EDF Energy, Finn Air, IPRO, Twist, Penn National Gaming Inc
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Matillion ETL vs. StreamSets and other solutions. Updated: December 2024.
824,067 professionals have used our research since 2012.