Introducing G2.ai, the future of software buying.Try now

IBM DataStage Reviews & Product Details

Profile Status

This profile is currently managed by IBM DataStage but has limited features.

Are you part of the IBM DataStage team? Upgrade your plan to enhance your branding and engage with visitors to your profile!

Value at a Glance

Averages based on real user reviews.

Time to Implement

7 months

Return on Investment

26 months

IBM DataStage Media

IBM DataStage Demo - DataStage-Flow-Designer01.png
DataStage Flow Designer
IBM DataStage Demo - DataStage-Flow-Designer02.png
Data Stage Flow Designer
IBM DataStage Demo - DataStage-Flow-Designer03.JPG
DataStage Flow Designer
Product Avatar Image

Have you used IBM DataStage before?

Answer a few questions to help the IBM DataStage community

IBM DataStage Reviews (72)

Reviews

IBM DataStage Reviews (72)

4.0
72 reviews

Search reviews
Filter Reviews
Clear Results
G2 reviews are authentic and verified.
Poojasree M.
PM
Associate Lead
Computer Software
Mid-Market (51-1000 emp.)
"Unmatched Performance and Reliability for Enterprise Data Workloads"
What do you like best about IBM DataStage?

The most impressive aspect of DataStage is its high-performance parallel processing engine, which allows it to handle massive enterprise data volumes with ease. By utilizing "pipelining" and "partitioning," the system can process different stages of a job simultaneously across multiple CPU nodes. This means that instead of waiting for one task to finish before the next begins, data flows through the pipeline like an assembly line, ensuring that even petabyte-scale workloads are completed within tight processing windows.

Furthermore, its visual design environment offers a sophisticated balance between simplicity and power. The drag-and-drop interface allows engineers to build complex ETL logic using pre-built "Stages" for joins, lookups, and transformations without needing to write manual code. However, it remains highly extensible for developers; if a specific requirement isn't met by a standard component, you can integrate custom Python scripts or SQL, making it flexible enough for both standard reporting and complex data science pipelines.

Finally, DataStage excels in enterprise-grade reliability and governance, which is why it remains a staple in highly regulated industries like finance and healthcare. It integrates seamlessly with metadata catalogs to provide end-to-end data lineage, allowing users to track exactly how data has changed from source to target. Combined with robust error-handling and "Reject Links" that capture bad data without crashing the entire job, it provides a level of stability and auditability that many lightweight or open-source tools struggle to match. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

One of the most significant drawbacks of IBM DataStage is its prohibitive cost and complex licensing model, which often makes it inaccessible for small-to-medium businesses. Beyond the high initial purchase price, the "IBM Tax" includes ongoing maintenance and specialized infrastructure requirements that scale aggressively with data volume. Furthermore, because the tool is highly proprietary, organizations face heavy vendor lock-in; migrating logic out of DataStage to a modern, open-source-friendly stack like dbt or Airbyte is notoriously difficult and time-consuming.

From a technical standpoint, many engineers find the platform increasingly clunky and "legacy" compared to agile, cloud-native alternatives. While its parallel engine is powerful, it requires deep, specialized expertise to tune—settings like partition methods and buffer sizes are manual and unintuitive, leading to a steep learning curve for new hires. Additionally, while the newer "Next Gen" versions have improved, the ecosystem is still criticized for being batch-heavy, making it less agile for teams that require modern real-time streaming or "DataOps" automation. Review collected by and hosted on G2.com.

IS
Analista de Processos
Mid-Market (51-1000 emp.)
"Exceptional Performance and Connectivity with Intuitive Interface"
What do you like best about IBM DataStage?

Wide Connectivity, High Performance and Scalability, Intuitive Graphical Interface Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

High Learning Curve, Infrastructure Dependency Review collected by and hosted on G2.com.

Max R.
MR
Sócio-proprietário
Mid-Market (51-1000 emp.)
"Data Integration and Quality with DataStage"
What do you like best about IBM DataStage?

Best data integration tool on the market with a wide range of connectors and advanced data integration and quality features. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

I quite like the platform as a whole, but I believe it can improve regarding data lineage (it should indeed improve now with the arrival of Manta to the IBM portfolio). Review collected by and hosted on G2.com.

Kapil K.
KK
Graduate Data Engineer
Mid-Market (51-1000 emp.)
"Using Datastage for ETL"
What do you like best about IBM DataStage?

We use InfoSphere DataStage for ETL in our organisation and as datastage can easily handle large data (Tbs) and we can transform our data easily. It's easier to design our jobs in datastage and to run them. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

As a beginner I found using datastage hard. As there are so many functionalities and hence it takes time to get a hang of it. But once you start practicing it, it becomes easy. Review collected by and hosted on G2.com.

Verified User in Banking
UB
Enterprise (> 1000 emp.)
"IBM Datastage for ETL"
What do you like best about IBM DataStage?

IBM InfoSphere DataStage is simple yet efficient tool for ETL processing.

It has the variety of stages to implement your designs and test the same at runtime.

It has got additional features compared to other ETL tools, which helps in debugging and error handling. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

Datastage is UI is little at the backseat compared to other ETL tools.

Stages could be categorised based on functionalities. Review collected by and hosted on G2.com.

Verified User in Computer Software
UC
Mid-Market (51-1000 emp.)
"Analyzing vendor data"
What do you like best about IBM DataStage?

There are two reasons for us to use it, less cost, and because it's user friendly. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

Customer support is excellent, furthermore there can be some improvement on the number of features.

We did not face any problems during its implementation and its integration.

Frequency of use is not high as we are not just relying on it, but we might in future. Review collected by and hosted on G2.com.

MJ
Architect
Enterprise (> 1000 emp.)
Business partner of the seller or seller's competitor, not included in G2 scores.
"Powerful product"
What do you like best about IBM DataStage?

This tool has many options: a large number of connectors, ease of use of stages (jobs, sequences). It is possible to introduce code and make calls via restApi. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

Now it seems that you have to work from CloudPak as a cartridge.. this makes the solution more expensive Review collected by and hosted on G2.com.

Verified User in Financial Services
UF
Enterprise (> 1000 emp.)
"Data Stage review"
What do you like best about IBM DataStage?

- excellent performance in executing ETL processes for large amounts of data. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

- Lack of documentation and available knowledge for study and learning.

- Lack of support from the supplier (various problems with the product and also lack of support for functionalities like the quality stage).

- Interface is not at all intuitive and difficult to use. Review collected by and hosted on G2.com.

Simran T.
ST
Engineering Analyst
Small-Business (50 or fewer emp.)
"Review on IBM Infosphere Datastage"
What do you like best about IBM DataStage?

DataStage helps us to construct a source model that describes the rules for querying the source database. We have used several stages while making Dimension tables and fact table like transformer, lookup, joins etc. Steps are so easy to use that we must drag and drop the stages required for building the tables. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

The thing that I don't like about IBM Infosphere Datastage application is a plan of it is costly. Also, the Metadata propagation in Jobs is somewhat complex for some users and issues in the processing of XML. Review collected by and hosted on G2.com.

Verified User in Information Technology and Services
CI
Enterprise (> 1000 emp.)
"Good product"
What do you like best about IBM DataStage?

Its speed. It is very fast and responsive. Support is good. Review collected by and hosted on G2.com.

What do you dislike about IBM DataStage?

a little hard to use and implement. hs few bugs Review collected by and hosted on G2.com.

Pricing Insights

Averages based on real user reviews.

Time to Implement

7 months

Return on Investment

26 months

Perceived Cost

$$$$$
IBM DataStage Comparisons
Product Avatar Image
Apache NiFi
Compare Now
Product Avatar Image
Azure Data Factory
Compare Now
Product Avatar Image
IBM Cloud Pak for Integration
Compare Now
IBM DataStage Features
Reporting
Auditing
Extraction
Transformation
Loading
Product Avatar Image
IBM DataStage
View Alternatives