- Compatibility with the rest of Microsoft BI stack
- Scripting component which opens unlimited possibilities: by having the full .Net stack/libraries available, if you need a yellow rubber duck to quack every million rows, it will quack
- Inter-stream dependency mechanism built directly into the tool; the whole ETL solution can be driven off it without having to employ third party software
- Parallel processing of a data flow: let's say there's five steps in the flow and ech step consumes output from the preceding step; once step #1 is completed and its data passed to step #2, the engine will feed the next batch of rows to step #1. Once the first batch of rows reaches the last step, there are five concurrent steps working on five different data sets, maximizing utilization of server resources
- All the necessary types of enumerators (files, xml nodes, rows and many more)
Business Intelligence Consultant at a tech services company with 51-200 employees
All the necessary types of enumerators are available, but writing custom components and sharing them across multiple ETL streams is tricky.
What is most valuable?
How has it helped my organization?
- SSIS packages are stored in XML-like format; we keep them in SVN which makes it very easy to track changes
- Built-in dependency tool is really great; you can not only react to success/failure results, but you can evaluate any expression in the flow and execute (or bypass) any components based on the result of that expression. This gives you full control to build decision trees of any complexity
What needs improvement?
- Writing custom components and sharing them across multiple ETL streams is tricky and requires specialistic voodoo knowledge
- Make the GUI less eye-candy and more responsive; especially the scripting component needs some TLC regarding UI responsiveness
How was the initial setup?
It's easy to deploy, you just need a simple file-copy mechanism that is sufficient for most deployment scenarios.
Buyer's Guide
SSIS
October 2024
Learn what your peers think about SSIS. Get advice and tips from experienced pros sharing their opinions. Updated: October 2024.
816,406 professionals have used our research since 2012.
What other advice do I have?
The good:
- Straightforward, intuitive, quick to learn
- Built-in debugger: variables, data viewers, breakpoints - everything you would expect from a modern software development tool
- Sleek, snappy UI
- Great flexibility with storing and deploying ETL components
The bad:
- Third party plug-ins may become incompatible with future versions of SSIS
- Upgrading is usually painful and time consuming despite what MS says
- Compatibility issues may arise when used with non-Microsoft technologies
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Director at Ingenia
Easy to install, performs well, and provides good data integration functionality
Pros and Cons
- "The performance and stability are good."
- "The security could be improved, as it is more important in our context."
What is our primary use case?
We are a solution provider and SSIS is one of the products that we implement for our clients. I work as an integrator and a data flow developer.
SSIS is primarily used as part of the data flow for loading data into the data warehouse and exchanging data between applications.
What is most valuable?
The most valuable feature is the data integration process. It includes helpful functions such as data mapping, creating a connection, and loading.
The performance and stability are good.
What needs improvement?
The security could be improved, as it is more important in our context.
For how long have I used the solution?
I have been working with SQL Server Integration Services (SSIS) for six years.
What do I think about the stability of the solution?
The stability is good.
What do I think about the scalability of the solution?
We have four customers that are using this product.
How are customer service and support?
I have never been in contact with technical support.
How was the initial setup?
It's easy to install and create the first flow.
What about the implementation team?
We have an in-house team of four engineers for maintenance.
What's my experience with pricing, setup cost, and licensing?
All of my clients have this product included as part of their Microsoft license.
What other advice do I have?
In summary, this is a good product and I recommend it. For people that are working in a Microsoft environment with the SQL Server database, it's the most recommended tool.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer: Integrator
Buyer's Guide
SSIS
October 2024
Learn what your peers think about SSIS. Get advice and tips from experienced pros sharing their opinions. Updated: October 2024.
816,406 professionals have used our research since 2012.
Consultant at Veear Analytics pvt limited
A useful solution, but the stability is a big problem
Pros and Cons
- "The data reader is the most valuable feature."
- "We have a stability problem because when something works, it works one time. The next time, it doesn't work."
What is our primary use case?
I have used this solution for integrating SQL into life insurance projects.
What is most valuable?
The data reader is the most valuable feature.
What needs improvement?
If something is working, and then it doesn't work after it goes live, that creates a big problem. This solution is a useful tool, but the stability is a problem.
For how long have I used the solution?
I have been working with this solution for eight or nine months.
What do I think about the stability of the solution?
We have a stability problem because when something works, it works one time. The next time, it doesn't work. We don't know what is causing that problem.
What do I think about the scalability of the solution?
The solution is scalable. In the current use case, we have about 25 users of the solution.
How was the initial setup?
I found the setup comparatively easy. Within a day, anybody can start using it. Two or three people are more than enough for deployment and maintenance.
What other advice do I have?
The solution is working fine and it is useful. The only problem is the stability.
I rate this solution as a seven out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Assistant Manager at a tech services company with 10,001+ employees
In addition to the ETL challenges, it allows us to do DBA activitities, scripting and many other tasks.
Pros and Cons
- "In SSIS, the scope is not only to handle ETL challenges, but it will allow us to do so many other tasks, such as DBA activities, scripting, calling any .exe or scripts, etc."
- "SSIS can improve in handling different data sources like Salesforce connectivity, Oracle Cloud's connectivity, etc."
How has it helped my organization?
For the full version of the SQL Server, SSIS, SSAS and SSRS will come as additional features for free. Hence, my organization need not spend extra money for other ETL, reporting and analysis tools. This can give very good flexibility.
What is most valuable?
In SSIS, the scope is not only to handle ETL challenges, but it will allow us to do so many other tasks, such as DBA activities, scripting, calling any .exe or scripts, etc.
What needs improvement?
SSIS can improve in handling different data sources like Salesforce connectivity, Oracle Cloud's connectivity, etc. Also, handling of the different data types will be a big challenge here; so expecting improvement in these areas.
What do I think about the stability of the solution?
There were no stability issues.
What do I think about the scalability of the solution?
There were no scalability issues.
How are customer service and technical support?
Technical support is very good.
Which solution did I use previously and why did I switch?
Based on the client's requirements, we switched over to this solution.
How was the initial setup?
The setup is straightforward.
Which other solutions did I evaluate?
We looked at Informatica and DataStage.
What other advice do I have?
Don't worry, go ahead.
Disclosure: My company has a business relationship with this vendor other than being a customer:
Brian DandeneauBusiness Process and Strategy Specialist Advisor at NTTData
Top 5LeaderboardConsultant
Thanks for the review and keep them coming.
I like what you put in here for improvements. However, don't hold your breath for Oracle Cloud integration. Most Oracle ETL/ELT tools don't have direct cloud integration yet and its not on the roadmap for a few years.
V/r,
Brian Dandeneau
CEO Applied Governance
Engineer at a tech services company with 501-1,000 employees
SSIS vs. BizTalk - which is best for integration with Dynamics Ax
During the last couple of years, I have integrated data with Dynamics Ax both with SSIS and BizTalk. A common question I'm asked is what is the difference when every thing is possible in SSIS why do we need BizTalk or what does BizTalk provide different from SSIS. So My answer to this question is something like:
Everything that BizTalk provides can be implemented in SSIS. But the major difference is batch processing. Usually SSIS package are used to migrate large set of data or dataset. BizTalk provide the operations to be perform on one message at time or real time processing. Because everything in BizTalk is XML so BizTalk is very slow on large set of data. BizTalk provides large number of adapters, while In SSIS you have to use direct connection by Oldb, or Sql db to communicate with different database and depend on OlDb connections. In BizTalk large number of Adapter provided to communicate which may or may not be depend on OlDB connection. Build in Tracking system (BAM) and its display on BAM portal is also big advantage on SSIS. For this purpose you have to make a custom tracking system in SSIS which require a lot of coding. Third advantage of BizTalk over SSIS is BRE. Business rule engine. BRE provide the condition whose value can be changed and complete follow of BizTalk application. These BRE roles can be used in multiple biztalk application while these functionality can be achieved on config files in SSIS.
In conclusion, when we required less data integration/migration and require complex decision making we used BizTalk. For example we have to implement complex work flow on single record. BizTalk application also used route data, read from one location, transform it and drop on other location. A simple example of this transactional data, when one transaction is occur in one system and its impact or integration will required on other system we will use BizTalk. BizTalk is a rapid development tool as compare to SSIS.
When we have a large sum of data, we require less complexity and requirement of integrated systems are based on Same technology then we have to use SSIS. Usually SSIS is used to migrate or integrate the non-transaction data or step up data. The delay of migration and integration possible or example Batch processing. SSIS is built for ETL process, it is not rapid integration tool.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
But I think the cost is a big factor to go for biztalk or ssis
Big Data Architect, Integration Specialist at Groundswell Group Inc
Easy to implement and supports many connectors, but performance-tuning requires extensive expertise
Pros and Cons
- "This solution is easy to implement, has a wide variety of connectors, has support for Visual Basic, and supports the C language."
- "Tuning using this solution requires extensive expertise to improve performance."
What is our primary use case?
Our primary use for this solution is to move data between points and applications.
How has it helped my organization?
This is a flexible tool to use and comes with the MSSQL server package.
What is most valuable?
This solution is easy to implement, has a wide variety of connectors, has support for Visual Basic, and supports the C language.
What needs improvement?
Tuning using this solution requires extensive expertise to improve performance.
For how long have I used the solution?
We have been using this solution for fifteen years.
What's my experience with pricing, setup cost, and licensing?
This solution is included with the MSSQL server package.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Owner at 7Spring Consult
Offers excellent integration and has great simplicity
Pros and Cons
- "The simplicity of the solution is great. The solution also offers excellent integration."
- "The interface could use improvement, as well as the administrative tools. Jobs fail from time to time for different reasons. It's not a problem with Microsoft, or SSIS itself. The problems are external, but to find the problems and analyze them it takes too much time."
What is most valuable?
The simplicity of the solution is great. The solution also offers excellent integration.
What needs improvement?
The interface could use improvement, as well as the administrative tools. Jobs fail from time to time for different reasons. It's not a problem with Microsoft, or SSIS itself. The problems are external, but to find the problems and analyze them it takes too much time. If SSIS could make some new monitors or new features for finding the reasons for the problems in the processes and analyzing it for how to correct the issues, that would be great.
For how long have I used the solution?
I've been using the solution for over six months.
What do I think about the stability of the solution?
The solution is stable.
What do I think about the scalability of the solution?
Right now we're working with Azure, and it's scalable, but it's expensive to do so. If we had features to scale the software part of the solution, it would be great.
How are customer service and technical support?
We have support not only for SSIS but Microsoft as a whole because we're an official partner of Microsoft in data warehousing and have a partnership agreement with the company. However, support seems to be connected with sales, and if I call with technical problems and ask if they have some feature to solve the problem, months can go by with no results.
They do, however, have a professional community online and in 95% of our incidents, we can find the solution there instead of calling.
How was the initial setup?
The initial setup was simple and straightforward.
What other advice do I have?
I would recommend the solution. It's always worked for me and it's usually a top choice for my clients.
I would rate this solution at seven or eight out of ten. It's not a perfect solution, but it works well.
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner.
Architect at a marketing services firm with 51-200 employees
SSIS vs Hadoop
On one corner we have Hadoop, a massively distributed JVM-based data processing engine with a Map & Reduce API and a proven track record in handling huge data-sets. On the other corner we have SSIS, a natively non-distributed ETL engine part of the SQL Server family tool-set with .NET code extensibility features and a drag and drop UI (for the most part anyway). Two sweet technologies, probably shouldn’t be compared to each other but we’re doing it anyway, pitted head to head against a data mapping task to the death (or at least to the recycling of my test VMs)… Now FIGHT!
Recently I have been tasked with building a data processing layer tracking social signals with the following characteristics:
- Input data is flat files. Although initially the amount of data might not be classified under “Big Data” per-say, but certainly had the potential to grow very quickly. Files were very small JSON format (1 KB average).
- Output data is flat files. Delimited file which will be queried through a Hive Warehouse layer.
- Data is only Mapped and not Reduced.Which means data is only extracted from the flat files and processed but never aggregated, and in any case SSIS is not capable of reducing (or aggregating) data in a scale-out architecture without building a custom intermediary layer (such as temporarily placing data in a database).
- Data Latency into Hive is of Paramount Importance.
Both technologies are capable of iterating through a large number of flat files, extracting information and building an output, and when we take the Reduce operation out of the equation, we level the playing field and now both technologies can be scaled out, albeit Hadoop in a perhaps more friendly manner.
Although these technologies have a wider application and usage that they might be better suited to, in this experiment I was only interested in performance figures on this basic task.
In order to test these technologies against the mapping task, I have built two test machines, one for SSIS with SQL Server to support the SSIS Catalogue database, and another for a simple 3 node Hadoop cluster, the technical specification for each scenario is as follows:
Integration Service (SSIS) | Hadoop | |
CPU | 4 Cores / Node | 2 Cores / Node |
RAM | 8 GB / Node | 3 GB / Node |
Nodes | 1 VM | 3 VMs |
OS | Windows Server 2012 | CentOS |
Edition | SQL Server 2012 | Cloudera CDH 4 |
Although the specifications for each test setup is slightly different, which makes the comparison fairly “unscientific”, the over-all processing resources available for each test scenario should be fairly comparable, with the Hadoop cluster gaining a slight edge in terms of over-all CPU cores and RAM. Besides, we are only looking for a really considerable difference in the result to warrant a favouritism of one technology over the other in this business requirement.
I ran two test scenarios:
- Scenario 1: 33,000 small (1KB) JSON input files, each file will have about 5 – 10 values to extract against a key (mapping).
- Scenario 2: 33 input files (every 1,000 files in scenario 1 is concatenated)
The results of the test were as follows:
Scenario 1 (33,000) | Scenario 2 (33) | |
SSIS | 14.5 (Min) | 3.94 (Sec) |
Hadoop Cluster | 957 (Min) | 134 (Sec) |
As can be deduced from the results above, 1 SSIS instance showed up to 66X better performance in handling and processing flat files than the same job running in a Hadoop cluster.
Learnings from SSIS vs Hadoop Test
There are a few key learnings that has been gained by doing this experiment:
- Hadoop has a terrible start time when operating on a file, the processing engine could take up-to 5 seconds before it could actually start processing the file, were SSIS takes less than 0.2 of a second. Java has never been a very agile language in my opinion.
- Hadoop is not intended to handle a large number of small files, instead try combining smaller files into bigger concatenations. Sometimes it is considerably faster to have a pre-processing step that concatenates files into smaller batches.
- Although the number of “Reducers” for a Hadoop job could be easily controlled, it is more difficult to control how many “Mappers” available for a job across the cluster, and Hadoop does not always adhere to the user-set number of Mappers.
- Although SSIS outperforms Hadoop by an average of 50X on this simple task, Hadoop scales in a much more user-friendly manner, and allows users to “Reduce” or aggregate the data across all nodes for a particular job, a feature that is not supported by the out-of-the-box Integration Service.
- Don’t just jump on new technologies, you need to test it and ensure that it is suitable for your particular business requirement, Hadoop is a great distributed processing engine when used in the correct context. It is too easy these days for managers and BI people to band around the term “Hadoop” for everything “Big Data”, from data processing to warehousing, but you need to take the time to separate the wheat from the chaff.
- HDInsight (Microsoft’s Hadoop distribution which runs on Windows and Azure) was another technology that we were investigating at the time, although performance was extremely terrible that it was eliminated from the race fairly quickly.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Download our free SSIS Report and get advice and tips from experienced pros
sharing their opinions.
Updated: October 2024
Product Categories
Data IntegrationPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
Azure Data Factory
Informatica PowerCenter
Teradata
Oracle Data Integrator (ODI)
Talend Open Studio
IBM InfoSphere DataStage
Oracle GoldenGate
Palantir Foundry
SAP Data Services
Qlik Replicate
StreamSets
Alteryx Designer
Denodo
Fivetran
Buyer's Guide
Download our free SSIS Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- Which ETL tool would you recommend to populate data from OLTP to OLAP?
- Microsoft SSIS vs. Informatica PowerCenter - which solution has better features?
- What Is The Biggest Difference Between Informatica PowerCenter and Microsoft SSIS?
- What is the biggest difference between SSIS and Pentaho?
- What are the pros and cons of SSIS?
- Which is better - SSIS or Informatica PowerCenter?
- When evaluating Data Integration, what aspect do you think is the most important to look for?
- Microsoft SSIS vs. Informatica PowerCenter - which solution has better features?
- What are the best on-prem ETL tools?
Thanks Gary. Yes, I am aware of the new SSISDB database and I am slowly migrating some of the projects to it. It is much more convenient, especially the way you can configure your solution on per-environment basis. Plus it is flexible, too. You can have just one instance of the SSISDB database for all your environments or one instance per environment, or even some hybrid mode. You can control and track progress of your executions using purely T-SQL, so lots of goodies. Haven't had a chance to play around with 2016 yet but things seem to be going in the right direction anyway.