Try our new research platform with insights from 80,000+ expert users
IBM InfoSphere DataStage Logo

IBM InfoSphere DataStage pros and cons

Vendor: IBM
3.9 out of 5
2,020 followers
Post review

Pros & Cons summary

Buyer's Guide

Get pricing advice, tips, use cases and valuable features from real users of this product.
Get the report

Prominent pros & cons

PROS

IBM InfoSphere DataStage offers highly customizable features that handle multiple data latencies such as scheduled batch, on-demand, and real-time in a single job.
It provides robust data management capabilities with high capacity to process large volumes of data, enhancing performance with parallel processing and scalability.
Users appreciate its versatile capacity for data integration and executing complex business rules, proving valuable for data warehousing and ETL processes.
The tool impresses with its simplicity in error logging and understanding data lineage, making it easier than other data integration tools.
IBM InfoSphere DataStage features enhanced connectivity with real-time integration tools like Kafka and cloud migration readiness, which are crucial for future-proofing data processes.

CONS

Documentation and in-application help need improvement, particularly for new features and building APIs.
The pricing is high compared to competitors, leading many clients to prefer other tools.
There is a need for enhanced integration with modern data sources and cloud technologies, along with support for missing connectors.
Performance issues exist, such as slow loading processes and the inability to provide real-time data to vendors.
The architecture is complex, causing frequent outages; this should be addressed for better reliability.
 

IBM InfoSphere DataStage Pros review quotes

UF
Jul 29, 2019
The product is a stable and powerful data management solution that can run in parallel mode for enhanced speed.
KS
Sep 26, 2019
The data lineage report can be filtered for reporting. The reports are user-friendly and take less time to find what you need.
reviewer1559628 - PeerSpot reviewer
Apr 22, 2021
As a data integration platform, it is easy to use. It is quite robust and useful for volumetric analysis when you have huge volumes of data. We have tested it for up to ten million rows, and it is robust enough to process ten million rows internally with its parallel processing. Its error logging mechanism is far simpler and easier to understand than other data integration tools. The newer version of InfoSphere has the data catalog and IDC lineage. They are helpful in the easy traceability of columns and tables.
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: December 2024.
824,067 professionals have used our research since 2012.
Yusuf Arslan - PeerSpot reviewer
Apr 15, 2024
DataStage has also improved its connectors, such as connectivity with Kafka for real-time data integration, cloud connectors, and others like Spark and HVAC SaaS. All these processes are expected to shift to the cloud in the next five years. It's a very robust tool, much like PowerCenter.
Tirthankar Roy Chowdhury - PeerSpot reviewer
Aug 3, 2022
The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities.
Murali B - PeerSpot reviewer
Mar 28, 2024
Compared to other ETL tools, DataStage has excellent debugging and development capabilities. And the availability of connectors, even though we sometimes have to opt for specific ones. Also, the availability of patches is good.
it_user953511 - PeerSpot reviewer
Jul 31, 2019
DataStage works better with Linux operating systems when the application services are hosted on Linux system equipment, but it's powerful on Windows too.
BB
Mar 5, 2021
We are mostly using transmission rules. It has a lot of functions and logic related to transmission. It is a user-friendly tool with in-built functions.
ARTURO MONTIEL - PeerSpot reviewer
Feb 21, 2024
The most valuable feature for our data processing needs is IBM InfoSphere DataStage's capability to handle ETL tasks with large record volumes.
Rahul Saxena - PeerSpot reviewer
Jan 23, 2024
The solution is very easy to use.
 

IBM InfoSphere DataStage Cons review quotes

UF
Jul 29, 2019
The interface needs work to be more user-friendly.
KS
Sep 26, 2019
We would be happy to see in next versions the ability to return several parameters from jobs. Now, jobs can return just one parameter. If they could return several parameters, that would be great.
reviewer1559628 - PeerSpot reviewer
Apr 22, 2021
Its documentation is not up to the mark. While building APIs, we had a lot of problems trying to get around it because it is not very user-friendly. We tried to get hold of API documentation, but the documentation is not very well thought out. It should be more structured and elaborate. In terms of additional features, I would like to see good reporting on performance and performance-tuning recommendations that can be based on AI. I would also like to see better data profiling information being reported on InfoSphere.
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: December 2024.
824,067 professionals have used our research since 2012.
Yusuf Arslan - PeerSpot reviewer
Apr 15, 2024
The deployment could be more straightforward.
Tirthankar Roy Chowdhury - PeerSpot reviewer
Aug 3, 2022
What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well.
Murali B - PeerSpot reviewer
Mar 28, 2024
In terms of intermediate storage, we have some challenges, especially with customers who store data in intermediate locations.
it_user953511 - PeerSpot reviewer
Jul 31, 2019
I really like this tool, but the administration should be on the same client application because a lot of administration features are not on the client-side, and they usually need to have administrative access. It's quite complicated to force IT teams to have separate administrative access from the developers.
BB
Mar 5, 2021
It doesn't have any big data connections. It would be good to have them because most of the systems are moving towards big data. There should also be a user-friendly way to interact with the cloud. Its loading process is very slow. It takes a lot of time for around 5 or 6 million records, and we are not able to provide real-time data to the vendors due to this delay. Its performance needs to be improved. It is also like a legacy system. It is not updated much. In higher versions, they only do small changes. We would like to have new features and new technologies.
ARTURO MONTIEL - PeerSpot reviewer
Feb 21, 2024
Improvements for DataStage could include better integration with modern data sources like cloud solutions and documents, along with enhancing its capability to handle non-structured data.
Rahul Saxena - PeerSpot reviewer
Jan 23, 2024
The troubleshooting guide is very bad.