Try our new research platform with insights from 80,000+ expert users
Project Manager at Blue Technology
Real User
Top 5
A highly-stable and scalable solution with seamless integration capabilities
Pros and Cons
  • "The concept of integration is a valuable feature of the product."
  • "The graphical user interface (GUI) feels a lot like the interfaces from the 1980s."

What is our primary use case?

We use it for many kinds of projects. For example, we use it for business intelligence, master data management, data quality, data governance, data integration to SAP, Oracle retail, and real-time integration.

What is most valuable?

The concept of integration is a valuable feature of the product. The product excels in this area. In fact, it is one of the two leading products in the integration field. 

Based on my experience, interacting with data through integration has been more profitable than any other products or projects. Data integration is a reliable feature.

What needs improvement?

The graphical user interface (GUI) feels a lot like the interfaces from the 1980s. Regarding IBM, they initially indicated that version 8.11.7 would be their final release. We require more connectors to establish clear connections to cloud services. Connecting to the cloud is not easy or transparent within the product. Although the product has the potential to connect to the cloud, configuring and setting it up is challenging.

In future releases, connecting to the cloud should be easy and transparent.

For how long have I used the solution?

I’ve been using this solution since 2005. I’m currently using version 11.7.

Buyer's Guide
IBM InfoSphere DataStage
January 2025
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: January 2025.
832,083 professionals have used our research since 2012.

What do I think about the stability of the solution?

The stability of the solution is ten on ten.

What do I think about the scalability of the solution?

The scalability of the solution is nine on ten.

How was the initial setup?

Setting up the system is difficult and requires a lot of technical expertise. It demands a high level of experience during the setup process. 

However, despite the difficulty, our team, possibly the only one in Mexico, is good at quickly completing the setup.

Most of our implementations or installations are on-premises, accounting for approximately 90% of the total. The remaining 10% of our implementations are on the cloud.

What was our ROI?

The product has proven to be exceptional and highly competitive in the market. As a result, I have experienced substantial financial success with the product. There are many examples we have for the ROI.

What's my experience with pricing, setup cost, and licensing?

The price of the product is reasonable, especially for mid-sized companies. We have been successfully selling to several mid-sized companies.

Which other solutions did I evaluate?

The market is indeed filled with numerous solutions. Currently, I am exploring Microsoft's offerings, particularly their integration capabilities. Also, the purpose and functionalities of Azure are pretty interesting. I have been working with Quick Link, and specifically, I have been working with Azure Database.

What other advice do I have?

If you want to learn about integrating data, this product offers a valuable learning opportunity. Then you can consider transitioning to the next product in line. The underlying concept remains the same. 

I believe this solution is more reliable and provides a better understanding of data integration, data quality management, and master data management. It covers all aspects of data management, making it easier to learn. 

This is my perspective, possibly influenced by my extensive experience working with this product for many years. It is the main strength of this solution.

Overall, I would rate the solution a ten out of ten. 

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Muharrem Iseri - PeerSpot reviewer
Managing Partner at a tech services company with 11-50 employees
Real User
Top 5Leaderboard
Easy to understand to monitor the data lineage from source to target but pricing could be better
Pros and Cons
  • "IBM is stable and accurate to monitor. It's easy to understand to monitor the data lineage from source to target."
  • "DataStage is quite expensive. It is too hard to find a consultant using DataStage in Turkey."

What is our primary use case?

IBM InfoSphere DataStage is a core ETL tool. We use it with source systems like mainframes. DataStage is perfectly suited for extracting data from mainframes.

What is most valuable?

IBM is stable and accurate to monitor. It's easy to understand to monitor the data lineage from source to target.

What needs improvement?

DataStage is quite expensive. It is too hard to find a consultant using DataStage in Turkey.

For how long have I used the solution?

I have been using IBM InfoSphere DataStage for three years. I also used this solution for two years back in 2009-10.

What do I think about the stability of the solution?

The product is stable.

I rate the solution’s stability a nine out of ten.

What do I think about the scalability of the solution?

I rate the solution’s scalability an eight out of ten.

How are customer service and support?

The quality and response time of support is fine. It's pretty quick.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

Informatica is the first choice for me. It's easy to use and not so expensive compared to DataStage.

How was the initial setup?

You install IBM InfoSphere DataStage once you've set it up properly. It's robust and reliable, but initially configuring it can be challenging.

What other advice do I have?

The first consideration is the type of source system they have, whether it is a mainframe or not. Another key indicator for me to suggest DataStage is if the client has other IBM ecosystems, such as data quality or IBM governance tools. This makes it highly suitable because you can easily establish data lineage.

Overall, I rate the solution a seven out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user
Buyer's Guide
IBM InfoSphere DataStage
January 2025
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: January 2025.
832,083 professionals have used our research since 2012.
Partner at Avydium Data LLC
User
Its parallel processing capability allows you to go through extremely large data sets in no time at all
Pros and Cons
  • "Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job."
  • "Working with some of the big data components is good, but I can see improvements are needed."

What is our primary use case?

Complex data integration projects which require integration from multiple data sources.

How has it helped my organization?

I have worked during many implementations using DataStage. All of the projects that I worked on have been successful. This is due mainly to the strict discipline around best practices, and by following a set of standards and templates designed to reduce complexity and improve automation, including strong reference architecture.

What is most valuable?

  • Its parallel processing capability allows you to go through extremely large data sets in no time at all, if you do your job right. 
  • Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job. 
  • High scalability: Start small and go big with the same job. You just need to adjust the configuration file, no need to recompile.
  • Strong metadata management: Business, technical, and process metadata can all be managed from a single place.
  • Ease of integration with other tool sets: Easily supports APIs (or build your own) to support data streaming (or batched) from other systems.
  • Data Quality Management from within the tool: Supporting data sampling, including profiling of data, directly from the development canvas.

What needs improvement?

High-cost of ownership: They could take a page from open source software, such as Talend.

Working with some of the big data components is good, but I can see improvements are needed, such as native support for Spark and HBase.

For how long have I used the solution?

More than five years.

What do I think about the stability of the solution?

No issues.

What do I think about the scalability of the solution?

No issues.

How are customer service and technical support?

Support is always good.

Which solution did I use previously and why did I switch?

Have used quite a few ETL tools in my job.

  • Ab Initio: Even pricier, but has a highly competent ETL tool. It is complete, but hard to use. 
  • Informatica: Not as flexible and does not support the same level of complexity in its maps.
  • Talend: It is a good tool suite, extensive, but can be cumbersome to cite all its pieces.
  • ODI: For the Oracle centric world.
  • SSIS: Week when compared to any of the above tool sets.

How was the initial setup?

Depends on type of environment that is being installed. I have seen fairly simple to overly complex initial setups due to the environment, not due to the tool.

What about the implementation team?

Both vendor and in-house team implementations:

IBM has top-notch support and tool services along with other partners as well. Depending on the partner, this can go from installation and configuration to solution development, etc.)

Most in-house teams that I have seen tend to have have good developers, but not always good architects. Like most every data integration project, if you do not have a strong architecture, your solution will eventually fail.

What was our ROI?

Depends on the project.

Which other solutions did I evaluate?

Have done many ETL tool evaluations based on client requirements. DataStage has always been in the top-three. It may not have been selected due to different weights being used for different sections of the evaluation for different clients, but it has always been in the top-three consistently.

What other advice do I have?

If you have the budget and your solution requires industrial/enterprise strength data integration, this product is always a good choice.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Systems Integration Associate Director at a computer software company with 10,001+ employees
Real User
Helpful support, and the Hierarchical Data Stage is good
Pros and Cons
  • "The Hierarchical Data Stage is good."
  • "The interface needs improvement."

What is our primary use case?

We are a consulting company and we use this solution for our clients. We set up the data for them. We have various healthcare-related information from their vendor and business partners. They have integrated them and get data reports from it.

How has it helped my organization?

It improves how our client's organization functions.

What is most valuable?

We mainly use the designer and developer qualities. We use the basic features that we have.

They have many good features. The Hierarchical Data Stage is good.

What needs improvement?

The interface needs improvement. The interface in Informatica is easier than in DataStage.

The licensing can be improved. Many companies are moving away from DataStage because it is expensive.

The biggest issue that is unclear is how are they integrating into DevOps when they are binary files.

We would like to see DataStage integrated with DevOps so that a pipeline can be created for auto-deployment. Right now we are all doing it manually.

For how long have I used the solution?

I have been working with IBM InfoSphere DataStage for seven years.

We have the 11.3 version but have recently migrated to the 11.7 version.

What do I think about the stability of the solution?

It's a stable product, it's not new.

What do I think about the scalability of the solution?

It's very scalable. Our clients are medium-sized companies with a 1.5 billion turnover.

How are customer service and technical support?

We reached out to IBM because the file was not readable, and they resolved the issue.

Technical support is good. I have not found any issues with technical support. I would rate them an eight out of ten.

In some cases, they have a delay in giving suggestions for the configuration.

Which solution did I use previously and why did I switch?

Previously, in another company, I worked with Informatica. There are not a lot of differences but the interface is easier than it is in DataStage.

How was the initial setup?

I don't do the setup, but I think that they have many challenges.

Initially, we had challenges with the configuration. We were trying to use the comparison for Excel, and reading the Excel files from the source, but the files were not readable.

What's my experience with pricing, setup cost, and licensing?

It's very expensive.

Which other solutions did I evaluate?


What other advice do I have?

I am not a developer, I have a team within our company for that.

There is a cloud migration strategy going on, so they are thinking of moving to the cloud. They want a tool that is not heavy and suitable for their budget.

The recommendation for using this tool would depend on the requirements. 

I don't have anything bad to say about this product.

I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Utkarsh Shrivastava - PeerSpot reviewer
ETL/Solution Architect at Crux
Real User
Good performance optimization and useful for ETL purposes when we're building data warehouses or data marts
Pros and Cons
  • "The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms"
  • "In the future, I would like to see more integration with cloud technologies."

What is our primary use case?

The primary use case is for ETL purposes for when we're building data warehouses or data marts. We use it to get the data from different disparate sources, do some ETL on them, and we use DataStage and then load them into the data warehouse, database, or data mart.

This solution used to be on-premises, but they've recently come out with a hybrid offering.

What is most valuable?

The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms. I have not found those in Informatica or Talend.

What needs improvement?

As a product, it needs to be more stable. It's a legacy product, so even though it's high-performing, it's not very stable compared to other products like Informatica or Talend. The UI also looks dated.

In the future, I would like to see more integration with cloud technologies. Technical support could be improved.

For how long have I used the solution?

I've worked with DataStage for about 9 years.

What do I think about the stability of the solution?

The stability could be better.

What do I think about the scalability of the solution?

It's scalable.

How are customer service and support?

I would rate technical support 6 out of 10.

How was the initial setup?

For the on-prem solution, it was moderately complex. I'm not sure about the hybrid version.

What other advice do I have?

I would rate this solution 8 out of 10.

Which deployment model are you using for this solution?

Hybrid Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
reviewer2260647 - PeerSpot reviewer
Technical Data Analyst at a government with 10,001+ employees
Real User
Helps to migrate old databases to newer ones
Pros and Cons
  • "We can view what we want to do. We can transform data and put them on tables."
  • "I want the tool to continue with the on-prem version, not the cloud one."

What is our primary use case?

We migrate data from old databases to new databases. 

What is most valuable?

We can view what we want to do. We can transform data and put them on tables. 

What needs improvement?

I want the tool to continue with the on-prem version, not the cloud one. 

For how long have I used the solution?

I have been using the solution for six years. 

What do I think about the stability of the solution?

I rate the solution's stability a seven out of ten. 

What do I think about the scalability of the solution?

My company has 12 users for IBM InfoSphere DataStage. I rate its scalability a nine out of ten. 

How are customer service and support?

It takes time to get answers from the support. The responsiveness varies. Sometimes it is fast, and other times it is slow. 

How would you rate customer service and support?

Neutral

How was the initial setup?

IBM has good installation documents. The tool's deployment took a day to complete. 

What other advice do I have?

As part of the tool's maintenance, we install the updates and take backups. We rely on two engineers to help with the process.  I rate IBM InfoSphere DataStage an eight out of ten. 

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Roshan Jayakodi - PeerSpot reviewer
Consultant - Data Engineering at South Asian Technologies
Reseller
Data integration tool that is scalable and offers good customer support for premium customers
Pros and Cons
  • "When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses."
  • "Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface."

What is our primary use case?

We started using this solution as we needed a system upgrade. We had to move the Db2 data to a AS400 system. 

What is most valuable?


What needs improvement?

Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface. 

For how long have I used the solution?

We have been using this solution for a couple of months.

What do I think about the scalability of the solution?

This is a scalable solution. 

How are customer service and support?

When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses. 

How would you rate customer service and support?

Positive

How was the initial setup?

The infrastructure and the software configuration part was done by one of my teammates. We were able to complete it in two working days. This was only for the installation of DataStage. 

What other advice do I have?

I would advise others to identify the communication between servers and the client tools correctly. If working from a client environment and connecting to the server, configuration should be done correctly, otherwise you may encounter some issues.

I would rate this solution an eight out of ten. 

Which deployment model are you using for this solution?

On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer:
PeerSpot user
it_user953511 - PeerSpot reviewer
Technical Lead at a tech services company with 5,001-10,000 employees
Real User
A powerful solution for complex transactions with an extremely straightforward setup
Pros and Cons
  • "DataStage works better with Linux operating systems when the application services are hosted on Linux system equipment, but it's powerful on Windows too."
  • "I really like this tool, but the administration should be on the same client application because a lot of administration features are not on the client-side, and they usually need to have administrative access. It's quite complicated to force IT teams to have separate administrative access from the developers."

What is most valuable?

DataStage itself is a very powerful tool and you have a lot of transformations that you can do. In comparison to Informatica, you can run very complex transactions on it. It's a precise and powerful environment. And when you have ETLs and you all your data documented on the data manager and you have the rest of management from IBM itself on InfoSphere, it's very powerful, especially when you use the whole suite. It helps with end-user technologies and gives you better imaging. 

It's powerful in administration mode. It works well when you're using solutions like DataConnect, IBM CDC, etc.

DataStage works better with Linux operating systems when the application services are hosted on a Linux system equipment, but it's powerful on Windows too.

Also, no matter what language you use, you can always transform the data.

I'm not sure about the latest versions, however, as I'm on 11.3, which is about five years old.

What needs improvement?

I really like this tool, but the administration should be on the same client application because a lot of administration features are not on the client-side, and they usually need to have administrative access. It's quite complicated to force IT, teams, to have separate administrative access from the developers. 

The platform also needs more stability. It caches a lot. It crashes on the application servers that the host allows on the platform.

The solution needs better online tools for data, or for sourcing data on the internet. They have InfoSphere exchange but it's not as useful for DataStage. 

For how long have I used the solution?

I've been using the solution for three years.

What do I think about the stability of the solution?

The stability is related to the operating system your company uses. It crashes occasionally when you're hosting the application servers, but only in the servers on the operating system. With Linux, because Linux itself is very robust, it crashes less. For example, we have a telecom with 40 million users and using Linux it crashed maybe two times a year. However, when the solution was hosted on the Windows platform, it crashed two to three times a month. 

The crashes are related to memory and if it's not automated, you have to deal with it manually. On Windows, you have to release the cache memories manually, but on Linux, you don't have to do that, which is why you get less crashing. 

What do I think about the scalability of the solution?

The solution is very scalable, but at a certain point, it consumes a lot of resources. In order to scale, you need a lot of memory and a lot of people.

How are customer service and technical support?

The quality of technical support you can expect usually depends on the region. In Egypt, it was not great, but in Jordan, Dubai, and Kuwait, it was good. That was a couple of years ago. I'm not using the solution right now, so I can't say for sure if this is still the case.

How was the initial setup?

The solution has one of the most straightforward setups. It's even easier than Office.

What other advice do I have?

The last version I interacted with was 11.3 because the later versions were cloud-based and usually our customers didn't want to use the solution on the cloud.

In terms of advice, I would give to anyone trying to implement the solution is this: you to have accurate sizing. Clients always do the sizing wrong and they need more experience to get the sizing right. Setting up the environments takes sizing into account but it usually makes a lot of problems if the sizing is poor when it starts to operate. Then you have re-implement and it will require an increase in resources that will change your budget. 

I would rate this solution nine out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros sharing their opinions.
Updated: January 2025
Product Categories
Data Integration
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros sharing their opinions.