Try our new research platform with insights from 80,000+ expert users

Pentaho Data Integration and Analytics vs SSIS comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024
 

Categories and Ranking

Pentaho Data Integration an...
Ranking in Data Integration
24th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
52
Ranking in other categories
No ranking in other categories
SSIS
Ranking in Data Integration
4th
Average Rating
7.6
Reviews Sentiment
6.7
Number of Reviews
72
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of December 2024, in the Data Integration category, the mindshare of Pentaho Data Integration and Analytics is 1.5%, up from 0.6% compared to the previous year. The mindshare of SSIS is 8.8%, down from 9.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Q&A Highlights

RS
Sep 09, 2021
 

Featured Reviews

Ryan Ferdon - PeerSpot reviewer
Low-code makes development faster than with Python, but there were caching issues
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was. It was kind of buggy sometimes. And when we ran the flow, it didn't go from a perceived start to end, node by node. Everything kicked off at once. That meant there were times when it would get ahead of itself and a job would fail. That was not because the job was wrong, but because Pentaho decided to go at everything at once, and something would process before it was supposed to. There were nodes you could add to make sure that, before this node kicks off, all these others have processed, but it was a bit tedious. There were also caching issues, and we had to write code to clear the cache every time we opened the program, because the cache would fill up and it wouldn't run. I don't know how hard that would be for them to fix, or if it was fixed in version 10. Also, the UI is a bit outdated, but I'm more of a fan of function over how something looks. One other thing that would have helped with Pentaho was documentation and support on the internet: how to do things, how to set up. I think there are some sites on how to install it, and Pentaho does have a help repository, but it wasn't always the most useful.
Sakiru Dosumu - PeerSpot reviewer
Its ability to transform and transport data is extremely valuable
he ability of SSIS to transform and transport data is extremely valuable to me. It allows for intelligent extraction and manipulation of data during the process. Improved error handling would enhance ETL processes further. I haven't directly utilized the data flow components but they seem capable of supporting complex data integration needs.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The area where Lumada has helped us is in the commercial area. There are many extractions to compose reports about our sales team performance and production steps. Since we are using Lumada to gather data from each industry in each country. We can get data from Argentina, Chile, Brazil, and Colombia at the same time. We can then concentrate and consolidate it in only one place, like our data warehouse. This improves our production performance and need for information about the industry, production data, and commercial data."
"Sometimes, it took a whole team about two weeks to get all the data to prepare and present it. After the optimization of the data, it took about one to two hours to do the whole process. Therefore, it has helped a lot when you talk about money, because it doesn't take a whole team to do it, just one person to do one project at a time and run it when you want to run it. So, it has helped a lot on that side."
"Provides a good open source option."
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there."
"We use Lumada’s ability to develop and deploy data pipeline templates once and reuse them. This is very important. When the entire pipeline is automated, we do not have any issues in respect to deployment of code or with code working in one environment but not working in another environment. We have saved a lot of time and effort from that perspective because it is easy to build ETL pipelines."
"I absolutely love Hitachi. I'm one of the forefront supporters of Hitachi for my firm. It's so easy to integrate within our environments. In terms of being able to quickly build ETL jobs, transform, and then automate them, it's really easy to integrate throughout for data analytics."
"We can schedule job execution in the BA Server, which is the front-end product we're using right now. That scheduling interface is nice."
"Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us."
"The setup was easy. All Microsoft products are easy to set up."
"The performance and stability are good."
"The simplicity of the solution is great. The solution also offers excellent integration."
"Its compatibility with Microsoft products has been very valuable to our company. It fits well within the architecture."
"The ability of SSIS to transform and transport data is extremely valuable to me. It allows for intelligent extraction and manipulation of data during the process. Improved error handling would enhance ETL processes further. I haven't directly utilized the data flow components but they seem capable of supporting complex data integration needs."
"It has good data integration and good processes."
"It is easy to set up the solution."
"It's a competent product."
 

Cons

"It's not very stable, at least not in the case of the community edition. I'm working with the community edition right now and I think perhaps it is because of that it is not very stable, it causes the system to sometimes hang. I'm not sure if this is the case for pair tiers."
"​There is not a data quality or MDM solution in the Pentaho DI suite.​"
"I would like to see improvements made for real-time data processing."
"One thing that I don't like, just a little, is the backward compatibility."
"In terms of the flexibility to deploy in any environment, such as on-premise or in the cloud, we can do the cloud deployment only through virtual machines. We might also be able to work on different environments through Docker or Kubernetes, but we don't have an Azure app or an AWS app for easy deployment to the cloud. We can only do it through virtual machines, which is a problem, but we can manage it. We also work with Databricks because it works with Spark. We can work with clustered servers, and we can easily do the deployment in the cloud. With a right-click, we can deploy Databricks through the app on AWS or Azure cloud."
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
"The web interface is rusty, and the biggest problem with Pentaho is debugging and troubleshooting. It isn't easy to build the pipeline incrementally. At least in our case, it's hard to find a way to execute step by step in the debugging mode."
"Lumada could have more native connectors with other vendors, such as Google BigQuery, Microsoft OneDrive, Jira systems, and Facebook or Instagram. We would like to gather data from modern platforms using Lumada, which is a better approach. As a comparison, if you open Power BI to retrieve data, then you can get data from many vendors with cloud-native connectors, such as Azure, AWS, Google BigQuery, and Athena Redshift. Lumada should have more native connectors to help us and facilitate our job in gathering information from these new modern infrastructures and tools."
"I come from a coding background and this tool is graphically based. Sometimes I think it's cumbersome to do mapping graphically. If there was a way to provide a simple script, it would be helpful and make it easier to use."
"There are a lot of things that Microsoft could improve in relation to SSIS. One major problem we faced was when attempting to move some Excel files to our SQL Server. The Excel provider has a limitation that prevents importing more than 255 columns from a particular Excel file to the database. This restriction posed a significant issue for us."
"Integration and the user interface are areas with certain shortcomings that require some improvements."
"When I compare Talend and SSIS, Talend provides more features. With Talend, we can handle a large volume of data. Talend is usually used to treat a large volume of data, which makes it better than SSIS on the data side. Talend also has a very good Talend Management Console to schedule the jobs and do other things. It can also be easily connected to version control tools such as GitHub or SVN. The last time I used SSIS, it was connected through TSS for the Windows Console version. I am not sure it has been improved or not. If it is not improved, Microsoft should improve it. They should change the product to provide another console."
"There is connectivity with other databases, however, this is the most significant issue that has to be addressed."
"The high prices attached to the product can be an area of concern where improvements are required."
"In terms of its performance, it could be better. That could be something that would be easy and welcomed as an upgrade."
"SSIS can improve by the minimum code requirements in stored procedures and exporting data is difficult. They could make it easier, it should be as easy as it is to import data."
 

Pricing and Cost Advice

"Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
"I mostly used the open-source version. I didn't work with a license."
"The price of the regular version is not reasonable and it should be lower."
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"We are using the Community Edition. We have been trying to use and sell the Enterprise version, but that hasn't been possible due to the budget required for it."
"The solution comes free of cost."
"The solution is economical. You don't have to worry about the pricing as long as you're installing both services on the same server."
"It comes bundled with other solutions, which makes it difficult to get the price on the specific product."
"It would be beneficial if the solution had a less costly cloud offering."
"This solution has provided an inexpensive tool, and it is easy to find experienced developers."
"This solution is included with the MSSQL server package."
"SSIS' licensing is a little high, but it gives good value for money."
"Depending on the arrangement that a certain company has with Microsoft, it may supply the permanent license that is included in the SQL server license, or it may be a time-bound license if it is a partner license or other enterprise license."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
824,053 professionals have used our research since 2012.
 

Comparison Review

it_user90069 - PeerSpot reviewer
Feb 20, 2014
Informatica PowerCenter vs. Microsoft SSIS - each technology has its advantages but also have similarities
Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations…
 

Answers from the Community

RS
Sep 9, 2021
Sep 9, 2021
There are two products I know about * TimeXtender : Microsoft based, Transformation logic is quiet good and can easily be extended with T-SQL , Has a semantic layer that generates metat data for cubes . price approx 40K$, works with tables . Attunity (Bought by Qlik) : technology agnostic , nice web interface , expensive > 100K€. Works with transaction logs There are many other pure ETL ...
2 out of 12 answers
SS
Apr 6, 2020
We usually use Talend. Look here: https://community.talend.com/t5/Design-and-Development/The-way-from-OLTP-to-OLAP/td-p/116719
RS
Apr 7, 2020
Thanks Stefan !!! May I know if Talend is opensource /free ? It is available in all 3 editions - basic , standard end enterprise? it is capable to read data from all heterogeneous system like various types of data bases and files etc..?
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
15%
Government
8%
Comms Service Provider
5%
Financial Services Firm
18%
Computer Software Company
12%
Government
8%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
Which is better - SSIS or Informatica PowerCenter?
SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called SSIS. The collection helps organizations boost productivity with code-free compo...
What do you like most about SSIS?
The product's deployment phase is easy.
 

Also Known As

Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
SQL Server Integration Services
 

Overview

 

Sample Customers

66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
1. Amazon.com 2. Bank of America 3. Capital One 4. Coca-Cola 5. Dell 6. E*TRADE 7. FedEx 8. Ford Motor Company 9. Google 10. Home Depot 11. IBM 12. Intel 13. JPMorgan Chase 14. Kraft Foods 15. Lockheed Martin 16. McDonald's 17. Microsoft 18. Morgan Stanley 19. Nike 20. Oracle 21. PepsiCo 22. Procter & Gamble 23. Prudential Financial 24. RBC Capital Markets 25. SAP 26. Siemens 27. Sony 28. Toyota 29. UnitedHealth Group 30. Visa 31. Walmart 32. Wells Fargo
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. SSIS and other solutions. Updated: November 2024.
824,053 professionals have used our research since 2012.