Try our new research platform with insights from 80,000+ expert users

Pentaho Data Integration and Analytics vs SSIS comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Pentaho Data Integration an...
Ranking in Data Integration
30th
Average Rating
8.0
Reviews Sentiment
5.8
Number of Reviews
51
Ranking in other categories
No ranking in other categories
SSIS
Ranking in Data Integration
4th
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
71
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Data Integration category, the mindshare of Pentaho Data Integration and Analytics is 1.4%, up from 0.5% compared to the previous year. The mindshare of SSIS is 8.6%, down from 9.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Q&A Highlights

RS
Sep 09, 2021
 

Featured Reviews

Ryan Ferdon - PeerSpot reviewer
Low-code makes development faster than with Python, but there were caching issues
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was. It was kind of buggy sometimes. And when we ran the flow, it didn't go from a perceived start to end, node by node. Everything kicked off at once. That meant there were times when it would get ahead of itself and a job would fail. That was not because the job was wrong, but because Pentaho decided to go at everything at once, and something would process before it was supposed to. There were nodes you could add to make sure that, before this node kicks off, all these others have processed, but it was a bit tedious. There were also caching issues, and we had to write code to clear the cache every time we opened the program, because the cache would fill up and it wouldn't run. I don't know how hard that would be for them to fix, or if it was fixed in version 10. Also, the UI is a bit outdated, but I'm more of a fan of function over how something looks. One other thing that would have helped with Pentaho was documentation and support on the internet: how to do things, how to set up. I think there are some sites on how to install it, and Pentaho does have a help repository, but it wasn't always the most useful.
Sakiru Dosumu - PeerSpot reviewer
Its ability to transform and transport data is extremely valuable
he ability of SSIS to transform and transport data is extremely valuable to me. It allows for intelligent extraction and manipulation of data during the process. Improved error handling would enhance ETL processes further. I haven't directly utilized the data flow components but they seem capable of supporting complex data integration needs.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us."
"It has a really friendly user interface, which is its main feature. The process of automating or combining SQL code with some databases and doing the automation is great and really convenient."
"The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs."
"It is easy to use, install, and start working with."
"Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that."
"The area where Lumada has helped us is in the commercial area. There are many extractions to compose reports about our sales team performance and production steps. Since we are using Lumada to gather data from each industry in each country. We can get data from Argentina, Chile, Brazil, and Colombia at the same time. We can then concentrate and consolidate it in only one place, like our data warehouse. This improves our production performance and need for information about the industry, production data, and commercial data."
"Provides a good open source option."
"This solution allows us to create pipelines using a minimal amount of custom coding."
"It has the ability to be deployed into the cloud through Data Factory, and run completely as a software as a service in the cloud."
"The performance is good."
"It is also easy to learn and user-friendly. Microsoft is also good in terms of technical support. They have built a large community all over the world."
"The technical support is very good."
"I like that this solution is very scalable, accommodating large datasets and various types of servers. It integrates with most common database servers and allows for customization through coding, including complex scripts. Compared to Alteryx Designer, SSIS offers more customization. Its data cleaning capabilities are highly accurate because we can run tests as data is loaded, ensuring it meets all requirements before reaching the final destination. The ability to write custom SQL and C# code within SSIS packages is its greatest feature."
"The setup is straightforward. It's very easy to install."
"The ability of SSIS to transform and transport data is extremely valuable to me. It allows for intelligent extraction and manipulation of data during the process. Improved error handling would enhance ETL processes further. I haven't directly utilized the data flow components but they seem capable of supporting complex data integration needs."
"The simplicity of the solution is great. The solution also offers excellent integration."
 

Cons

"The performance could be improved. If they could have analytics perform well on large volumes, that would be a big deal for our products."
"The web interface is rusty, and the biggest problem with Pentaho is debugging and troubleshooting. It isn't easy to build the pipeline incrementally. At least in our case, it's hard to find a way to execute step by step in the debugging mode."
"I would like to see more improvements with AS400 DB2."
"The product needs more plugins."
"The support for the Enterprise Edition is okay, but what they have done in the last three or four years is move more and more things to that edition. The result is that they are breaking the Community Edition. That's what our impression is."
"I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking."
"I work with different databases. I would like to work with more connectors to new databases, e.g., DynamoDB and MariaDB, and new cloud solutions, e.g., AWS, Azure, and GCP. If they had these connectors, that would be great. They could improve by building new connectors. If you have native connections to different databases, then you can make instructions more efficient and in a more natural way. You don't have to write any scripts to use that connector."
"Some of the scheduling features about Lumada drive me buggy. The one issue that always drives me up the wall is when Daylight Savings Time changes. It doesn't take that into account elegantly. Every time it changes, I have to do something. It's not a big deal, but it's annoying."
"The interface could use improvement, as well as the administrative tools. Jobs fail from time to time for different reasons. It's not a problem with Microsoft, or SSIS itself. The problems are external, but to find the problems and analyze them it takes too much time."
"We purchase an add on called task factory primarily to allow bulk delete, update and upsert capability. I'd like to see this be part of the standard package."
"We've had issues in terms of the amount of data that is transferred when we are scheduling."
"When I compare Talend and SSIS, Talend provides more features. With Talend, we can handle a large volume of data. Talend is usually used to treat a large volume of data, which makes it better than SSIS on the data side. Talend also has a very good Talend Management Console to schedule the jobs and do other things. It can also be easily connected to version control tools such as GitHub or SVN. The last time I used SSIS, it was connected through TSS for the Windows Console version. I am not sure it has been improved or not. If it is not improved, Microsoft should improve it. They should change the product to provide another console."
"I would also like to see full integration with our BI because then our full load of data will be available in our organization. They should incorporate an ATL process."
"Improvement as per customer requirements."
"Sometimes, there are compatibility issues with some features. From time to time, I also face issues when trying to migrate. If I misconfigure things when I use Snapshot, the migration will fail.It can take a long time to migrate huge amounts of data, so it would be nice if that could be faster."
"The solution should work on the GPU, graphical processing unit. There should also be piping integration available."
 

Pricing and Cost Advice

"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"I mostly used the open-source version. I didn't work with a license."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"I use it because it is free. I download from their page for free. I don't have to pay for a license. With other tools, I have to pay for the licenses. That is why I use Pentaho."
"The solution reduced our ETL development time by a lot because a whole project used to take about a month to get done previously. After having Lumada, it took just a week. For a big company in Brazil, it saves a team at least $10,000 a month."
"SSIS is a cheaper option compared to the cost of other ETL tools."
"t's incredibly cost effective, easy to learn the basics quickly (although like all ETL tools requires the traditional learning curve to get good at) and has an immense user base."
"We purchased the standard edition of SQL Server and SSIS came with it free of charge."
"It comes bundled with other solutions, which makes it difficult to get the price on the specific product."
"People have to opt for a perpetual-based licensing model."
"SSIS is fairly well-priced - I would rate it at four out of five."
"If you don't want to pay a lot of money, you can go for SSIS, as its open-source version is available. When it comes to licensing, SSIS can be expensive."
"The solution is economical. You don't have to worry about the pricing as long as you're installing both services on the same server."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Comparison Review

it_user90069 - PeerSpot reviewer
Feb 20, 2014
Informatica PowerCenter vs. Microsoft SSIS - each technology has its advantages but also have similarities
Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations…
 

Answers from the Community

RS
Sep 9, 2021
Sep 9, 2021
There are two products I know about * TimeXtender : Microsoft based, Transformation logic is quiet good and can easily be extended with T-SQL , Has a semantic layer that generates metat data for cubes . price approx 40K$, works with tables . Attunity (Bought by Qlik) : technology agnostic , nice web interface , expensive > 100K€. Works with transaction logs There are many other pure ETL ...
2 out of 12 answers
SS
Apr 6, 2020
We usually use Talend. Look here: https://community.talend.com/t5/Design-and-Development/The-way-from-OLTP-to-OLAP/td-p/116719
RS
Apr 7, 2020
Thanks Stefan !!! May I know if Talend is opensource /free ? It is available in all 3 editions - basic , standard end enterprise? it is capable to read data from all heterogeneous system like various types of data bases and files etc..?
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
14%
Government
7%
Comms Service Provider
5%
Financial Services Firm
17%
Computer Software Company
12%
Government
8%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
Which is better - SSIS or Informatica PowerCenter?
SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called SSIS. The collection helps organizations boost productivity with code-free compo...
What do you like most about SSIS?
The product's deployment phase is easy.
 

Also Known As

Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
SQL Server Integration Services
 

Overview

 

Sample Customers

66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
1. Amazon.com 2. Bank of America 3. Capital One 4. Coca-Cola 5. Dell 6. E*TRADE 7. FedEx 8. Ford Motor Company 9. Google 10. Home Depot 11. IBM 12. Intel 13. JPMorgan Chase 14. Kraft Foods 15. Lockheed Martin 16. McDonald's 17. Microsoft 18. Morgan Stanley 19. Nike 20. Oracle 21. PepsiCo 22. Procter & Gamble 23. Prudential Financial 24. RBC Capital Markets 25. SAP 26. Siemens 27. Sony 28. Toyota 29. UnitedHealth Group 30. Visa 31. Walmart 32. Wells Fargo
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. SSIS and other solutions. Updated: November 2024.
816,406 professionals have used our research since 2012.