Try our new research platform with insights from 80,000+ expert users

Equalum vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024
 

Categories and Ranking

Equalum
Ranking in Data Integration
53rd
Average Rating
9.2
Reviews Sentiment
7.1
Number of Reviews
7
Ranking in other categories
Data Replication (17th), Cloud Data Integration (28th)
StreamSets
Ranking in Data Integration
9th
Average Rating
8.4
Reviews Sentiment
7.1
Number of Reviews
22
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of December 2024, in the Data Integration category, the mindshare of Equalum is 0.1%, down from 0.3% compared to the previous year. The mindshare of StreamSets is 1.8%, up from 1.4% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

reviewer1525674 - PeerSpot reviewer
Frees staff to focus on data workflow and on what can be done with data, and away from the details of the technology
There are areas they can do better in, like most software companies that are still relatively young. They need to expand their capabilities in some of the targets, as well as source connectors, and native connectors for a number of large data sources and databases. That's a huge challenge for every company in this area, not just Equalum. If I had the wherewithal to create a tool that could allow for all that connectivity, it would be massive, out-of-the-box. There are all the updates every month. An open source changes constantly, so compatibility for these sources or targets is not easy. And a lot of targets are proprietary and they actually don't want you to connect with them in real time. They want to keep that connectivity for their own competitive tool. What happens is that a customer will say, "Okay, I've got Oracle, and I've got MariaDB, and I've got SQL Server over here, and I've got something else over there. And I want to aggregate that, and put it into Google Cloud Platform." Having connectors to all of those is extremely difficult, as is maintaining them. So there are major challenges to keeping connectivity to those data sources, especially at a CDC level, because you've got to maintain your connectors. And every change that's made with a new version that comes out means they've got to upgrade their version of the connector. It's a real challenge in the industry. But one good thing about Equalum is that they're up for the challenge. If there's a customer opportunity, they will develop and make sure that they update a connector to meet the needs of the customer. They'll also look at custom development of connectors, based on the customer opportunity. It's a work in progress. Everybody in the space is in the same boat. And it's not just ETL tools. It's everybody in the Big Data space. It's a challenge. The other area for improvement, for Equalum, is their documentation of the product. But that comes with being a certain size and having a marketing team of 30 or 40 people and growing as an organization. They're getting there and I believe they know what the deficiencies are. Maintaining and driving a channel business, like Equalum is doing, is really quite a different business model than the direct-sales model. It requires a tremendous amount of documentation, marketing information, and educational information. It's not easy.
Reyansh Kumar - PeerSpot reviewer
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Equalum has resulted in system performance improvements in our organization. Now, I am ingressing data off of multiple S3 sources, doing data processing, and formatting a schema. This would usually take me a couple of days, but now it takes me hours."
"It's got it all, from end-to-end. It's the glue. There are a lot of other products out there, good products, but there's always a little bit of something missing from the other products. Equalum did its research well and understood the requirements of large enterprise and governments in terms of one tool to rule them all, from a data migration integration perspective."
"The main impact for Oracle LogMiner is the performance. Performance is drastically reduced if you use the solution’s Oracle Binary Log Parser. So, if we have 60 million records, initially it used to take a minute. Now, it takes a second to do synchronization from the source and target tables."
"All our architectural use cases are on a single platform, not multiple platforms. You don't have to dump into different modules because it is the same module everywhere."
"Equalum is real-time. If you are moving from an overnight process to a real-time process, there is always a difference in what reports and analytics show compared to what our operational system shows. Some of our organizations, especially finance, don't want those differences to be shown. Therefore, going to a real-time environment makes the data in one place match the data in another place. Data accuracy is almost instantaneous with this tool."
"I found two features in Equalum that I consider the most valuable. One is that Equalum is a no-code tool. You can do your activities on its graphical interface, which doesn't require complex knowledge of extracting, changing, or loading data. Another feature of Equalum that I like the most is that it monitors the data transfers and tells you if there's any issue so that you can quickly check and correct it. Equalum also tells you where the problem lies, for example, if it's a hardware or communication issue."
"Equalum provides a single platform for core architectural use cases, including CDC replication, streaming ETL, and batch ETL. That is important to our clients because there is no other single-focus product that covers these areas in that much detail, and with this many features on the platform. The fact that they are single-minded and focused on CDC and ETL makes this such a rich solution. Other solutions cover these things a little bit in their multi-function products, but they don't go as deep."
"It's a really powerful platform in terms of the combination of technologies they've developed and integrated together, out-of-the-box. The combination of Kafka and Spark is, we believe, quite unique, combined with CDC capabilities. And then, of course, there are the performance aspects. As an overall package, it's a very powerful data integration, migration, and replication tool."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"It's very easy to integrate. It integrates with Snowflake, AWS, Google Cloud, and Azure. It's very helpful for DevOps, DataOps, and data engineering because it provides a comprehensive solution, and it's not complicated."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"One of the things I like is the data pipelines. They have a very good design. Implementing pipelines is very straightforward. It doesn't require any technical skill."
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up."
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now."
 

Cons

"I should be able to see only my project versus somebody else's garbage. That is something that would be good in future. Right now, the security is by tenants, but I would like to have it by project, e.g., this project has this source and flows in these streams, and I have access to this on this site."
"If you need to use the basic features of Equalum, for example, you don't even need data integration, then many competitors in the market can give you basic features. For instance, if you need batch ETL, you can pick among solutions in the market that have been around longer than Equalum. What needs improvement in Equalum is replication, as it could be faster. Equalum also needs better integration with specific databases such as Oracle and Microsoft SQL Server."
"Their UI could use some work. Also, they could make it just a little faster to get around their user interface. It could be a bit more intuitive with things like keyboard shortcuts."
"The deployment of their flows needs improvement. It doesn't work with a typical Git branching and CI/CD deployment strategy."
"Right now, they have a good notification system, but it is in bulk. For example, if I have five projects running and I put a notification, the notification comes back to me for all five projects. I would like the notification to come back only for one project."
"There is not enough proven integration with other vendors. That is what needs to be worked on. Equalum hasn't tested anything between vendors, which worries our clients. We need more proven vendor integration. It is an expensive product and it needs to support a multi-vendor approach."
"They need to expand their capabilities in some of the targets, as well as source connectors, and native connectors for a number of large data sources and databases. That's a huge challenge for every company in this area, not just Equalum."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"Using ETL pipelines is a bit complicated and requires some technical aid."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
 

Pricing and Cost Advice

"Equalum is rather expensive compared to its competitors. So, you have to make up that cost in time savings, and we usually do that. If we are saving money, it is because we are reducing our development time."
"They have a very simple approach to licensing. They don't get tied up with different types of connectivity to different databases. If you need more connectors or if you need more CPU, you just add on. It's component-based pricing."
"Equalum licensing costs vary, but I won't be able to give information on its fees."
"As soon as you have more than six users, Equalum is lower in cost [than Talend] and if the group gets bigger, it's quite a big delta. If more users want to use it, you don't end up with an increase in licensing costs, so that makes it very easy. And if you need more licenses or more sources, it's a very simple upgrade methodology."
"Equalum was reasonably priced. It is not like those million dollar tools, such as Informatica."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"I believe the pricing is not equitable."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"It's not so favorable for small companies."
"We are running the community version right now, which can be used free of charge."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
824,053 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Manufacturing Company
20%
Computer Software Company
17%
Financial Services Firm
8%
Government
7%
Financial Services Firm
17%
Computer Software Company
10%
Manufacturing Company
10%
Insurance Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Why should I use Equalum instead of LogMiner?
You'd want to use the Equalium Oracle Binary Log Parser because it's just better than the LogMiner. Sure, LogMiner is made by Oracle and probably the team knows some insight to make it efficient th...
Is Equalum compatible with all databases?
I'm using Equalum's data replication software for Oracle because that's the one database it's designed for. While it may sound limiting, when you find out how many solutions it can provide for you ...
Can I use Equalum for free?
No, it's not free but you can benefit from a free trial, though. There's an option to try their platform for a limited amount of time, so that may be useful to help you decide if you want to contin...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Comparisons

No data available
 

Learn More

Video not available
Video not available
 

Overview

 

Sample Customers

SIEMENS, GSK, Wal-Mart, T Systems
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Equalum vs. StreamSets and other solutions. Updated: November 2024.
824,053 professionals have used our research since 2012.