# Pentaho Data Integration Reviews
**Vendor:** Pentaho  
**Category:** [Big Data Integration Platforms](https://www.g2.com/categories/big-data-integration-platforms)  
**Average Rating:** 4.3/5.0  
**Total Reviews:** 17
## About Pentaho Data Integration
More than just ETL (Extract, Transform, Load), Pentaho Data Integration is a codeless data orchestration tool that blends diverse data sets into a single source of truth as a basis for analysis and reporting. Effortlessly managed in a drag-and-drop graphical interface, so you can easily track where it&#39;s coming from, where it&#39;s going and how it&#39;s transforming. Develop and maintain pipeline efficiency Scalability, simplicity, and self-service Leverage quality and lineage inputs for enhanced data observability and management



## Pentaho Data Integration Pros & Cons
**What users like:**

- Users appreciate the **user-friendly interface** and **abundance of functionalities** in Pentaho Data Integration for efficient ETL processes. (2 reviews)
- Users appreciate the **API integration capabilities** of Pentaho Data Integration, enabling swift data transfer and report generation. (1 reviews)
- Users value the **fast data transfer capabilities** of Pentaho Data Integration for seamless ETL processes across multiple sources. (1 reviews)
- Users appreciate the **fast and effective communication** in Pentaho Data Integration for seamless data transfers across sources. (1 reviews)
- Users value the **seamless connectivity** of Pentaho Data Integration, enabling rapid data transfer from multiple sources. (1 reviews)
- Users value the **extensive connectors** in Pentaho Data Integration for seamless data transfer across various sources. (1 reviews)
- Data Accessibility (1 reviews)
- Data Analysis (1 reviews)
- Database Management (1 reviews)
- Data Integration (1 reviews)

**What users dislike:**

- Users often experience **performance issues** when handling large data volumes, making job modifications frustratingly slow. (2 reviews)
- Users find the **slow learning curve** of Pentaho Data Integration challenging, desiring more tutorials for better understanding. (1 reviews)
- Users experience **performance issues** with Pentaho Data Integration, especially when handling large data volumes. (1 reviews)
- Users find that **modifying jobs in Pentaho is slow** , which can hinder efficiency despite fast job runs. (1 reviews)
- Users find modifying jobs in Pentaho Data Integration **slow and time-consuming** , impacting overall efficiency and productivity. (1 reviews)
- Users find the **slow processing** of job modifications frustrating, which hinders the overall efficiency with Pentaho Data Integration. (1 reviews)
- Time Consumption (1 reviews)

## Pentaho Data Integration Reviews
  ### 1. Pentaho an etl tool for bussiness

**Rating:** 3.5/5.0 stars

**Reviewed by:** Sandeep C. | Data Analyst, Enterprise (> 1000 emp.)

**Reviewed Date:** March 12, 2025

**What do you like best about Pentaho Data Integration?**

Pentaho is one of the best etl tool to extract ,transform and load the data among various sources ,it just requires connections of the database and transfers data very fast .it also executes sql and generates reports into excel or any other required source.it has all basic components like execute sql,table input,excel input ,excel output,txt output,hdfs output.

**What do you dislike about Pentaho Data Integration?**

Pentaho job runs fast but modifying a job is time consuming ,it is usually so slow.better if it fast like any other application and also required some tutorials from pentaho side.

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

It usually best to transfer huge data from one source to another like oracle to hue.It is best to generate reports into excel using sql queries.

  ### 2. One the best ETL tool

**Rating:** 4.5/5.0 stars

**Reviewed by:** Dhiraj D. | Data Engineer, Enterprise (> 1000 emp.)

**Reviewed Date:** August 08, 2025

**What do you like best about Pentaho Data Integration?**

It's a open source software still it's a very user-friendly interface and I just love all functionalities . It's a very simple design steps which are self explanatory kind of , easy to use.

**What do you dislike about Pentaho Data Integration?**

Sometimes faces performance issues when data volume is huge

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

We are using for our data pipelines and it's benifits us for loading of data used for our application and we use it for reporting purposes.

  ### 3. Totally worth it!!

**Rating:** 4.5/5.0 stars

**Reviewed by:** Karthick V. | Senior Software Engineer, Enterprise (> 1000 emp.)

**Reviewed Date:** March 31, 2022

**What do you like best about Pentaho Data Integration?**

Best price in market, Hitachi sponsored and high quality in data integration.

**What do you dislike about Pentaho Data Integration?**

Limitation in features, connector is  
having portability issue and less user friendly.

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

We used PDI for data integration for designed reports. So far, had the best experience.

  ### 4. ETL for Dashboards

**Rating:** 4.0/5.0 stars

**Reviewed by:** Verified User in Information Technology and Services | Mid-Market (51-1000 emp.)

**Reviewed Date:** October 08, 2020

**What do you like best about Pentaho Data Integration?**

Pentaho Data Integration (aka Kettle) is a tool included in the Pentaho suite that we use in our Smart Cities projects to obtain data from various data sources. It has a large number of tools already built for Input, Ouput, Transform ... that allow developers to save a lot of time. Its use is easy even for inexperienced users.

**What do you dislike about Pentaho Data Integration?**

If we want to have support with the Pentaho suite we should not use its Community version (free), but in some Smart Cities specifications of our clients they require a free and open source tool with associated support.

**Recommendations to others considering Pentaho Data Integration:**

The Pentaho suite has a Community version that is free and free software, so our recommendation is to download it and test it to verify that this tool meets your requirements. For our part, we recommend it as we use it practically whenever we need to extract data from a data source quickly and easily.

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

PDI allows us to obtain data from various data sources such as databases, excel files, csv, big data / hadoop type databases and use preconfigured tools so that obtaining this data is simple and parameterizable. Other languages such as python require the writing of complete modules, with PDI the implementation and debugging are integrated through Plug & Play tools.

  ### 5. PDI, best data cleaning tool

**Rating:** 5.0/5.0 stars

**Reviewed by:** Paco T. | Digital Technology Intern, Mid-Market (51-1000 emp.)

**Reviewed Date:** April 21, 2020

**What do you like best about Pentaho Data Integration?**

Pentaho comes in two editions, enterprise and community, I had experience with the community edition and here are all the advatages I see:

1. Its under apache2.0 license so while you read and work under the agreements, you can have this powerful tool for free
2. Has a very friendly user interface, so anybody, even without strong programming skill could make some transformations in just minutes
3. It has a wide variety of data inputs formats, allowing you to read from simple csv's or excels files to databases, json's and even s3 storage
4. It has a lot of tools for transformating your data without coding
5. If the functions that PDI has integrated aren't enough for you, you can add some scripting steps

**What do you dislike about Pentaho Data Integration?**

I see a strong oportunity on improving their documentation, sometimes its kinda hard finding examples for all the functionalities that PDI offers

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

I mainly use pantaho for transforming data on the ETL cycle, so I do cleansing of different sources and storage it in a DWH

  ### 6. ETL with graphical interface

**Rating:** 4.5/5.0 stars

**Reviewed by:** Verified User in Information Technology and Services | Mid-Market (51-1000 emp.)

**Reviewed Date:** June 10, 2020

**What do you like best about Pentaho Data Integration?**

Pentaho data integration is one of the most powerful tools for building ETL processes that we use within our Smart Cities projects. It is a tool with a graphical interface that allows you to debug quickly and easily and has a multitude of preconfigured modules. Furthermore, it combines very well with the Hitachi Pentaho CDE tool for the generation of Dashboards.

**What do you dislike about Pentaho Data Integration?**

When you want to do a very simple development maybe you can choose to use Python source code directly. There are other powerful alternatives like Talend Studio.

**Recommendations to others considering Pentaho Data Integration:**

Pentaho has a suite called Community that is free and available to everyone. In addition, it has many examples and information. We recommend trying it out before deciding if we need to purchase the paid version. It is a great tool and we recommend it.

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

Pentaho Data Integration allows us to collect data from different data sources such as both relational and non-relational databases such as Big Data (HDFS), it allows us to bring information from Excel files ... and almost from any source of information we need. Also, their debugging tools save us a lot of time.

  ### 7. Open Source ETL Tools

**Rating:** 4.5/5.0 stars

**Reviewed by:** zahit B. |  BI & DWH Consultant, Mid-Market (51-1000 emp.)

**Reviewed Date:** November 18, 2019

**What do you like best about Pentaho Data Integration?**

Pentaho Data Integration (PDI) is a free and open source tool for all users.
Pentaho Data Integration (PDI) is a very high performance product compared to the paid ETL tools. The product is quite simple to use. The components on the left side of the product have all the components that the user needs. (For example; excel connection, row value, etc.) In my experience, the Logging screen is not descriptive. Sometimes you cannot identify the source of the error. Other than that, I am very satisfied with the PDI tool

**What do you dislike about Pentaho Data Integration?**

Since there are no detailed explanations of the errors on the logging screen, sometimes we cannot find the cause of the error. Also in the user community microsoft, oracle is not as strong.

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

We needed to import the data from the json file into the tables in the database. With the Pentaho Data Integration tool, we have transferred the json files to the database. We designed daily job with Windows Task Scheduler.

  ### 8. Great Business Intelligence Tool

**Rating:** 5.0/5.0 stars

**Reviewed by:** Senando B. | Managent information system, Mid-Market (51-1000 emp.)

**Reviewed Date:** September 16, 2019

**What do you like best about Pentaho Data Integration?**

The  most like about Pentaho report data integration is it can handle large, millions of data files with no hussle, You can extract data from different databases with such a small amount of time. From data extraction you can use the report to build more power analytic chart and business intelligence that my colleagues helps a lot specially the sales and production to overcome the problems we can face on the future.

**What do you dislike about Pentaho Data Integration?**

Its not a dislike but i observe that when i run pentaho verions 3 the bootup is fast while on version five on above it takes a 5 to 7 minutes. I dont know if this is involve with the specs of the computer or the pentaho version itself has lot of features to load.

**Recommendations to others considering Pentaho Data Integration:**

When you need a powerful business intelligence tool pentaho data integration is perfect for you, It's extraction capabilities is so great, no hustle and easy to use,

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

The problem that i solve using pentaho data integration is extract million of different data from our databases and turn it on the reports and analytic charts that help my sales and production team on analyze problem on our product sales.

  ### 9. Ease of using etl software

**Rating:** 3.0/5.0 stars

**Reviewed by:** Manju S. | Teaching Assistant, Higher Education, Small-Business (50 or fewer emp.)

**Reviewed Date:** January 19, 2019

**What do you like best about Pentaho Data Integration?**

1. Easy to use GUI drag  drop option.
2. Single view of the complete process.
3. Sharing connections makes it easy for different jobs.
4. Ability to change or modify Individual transformations  is very helpful.
5. It is a free tool to us which is a added advantage. 


**What do you dislike about Pentaho Data Integration?**

1. While using Kettle environment variables the appication doesnt work as intended which can be frustrating.
2. Unless paying for the tool, the implementation is something that one has to figure out by themselves. 

**Recommendations to others considering Pentaho Data Integration:**

Friendly UI and ease of use

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

Stream lined process for creating data warehouse which has been beneficial to  complete the tasks provided to me. 

  ### 10. Excellent ETL UI for the non-programmer

**Rating:** 4.0/5.0 stars

**Reviewed by:** Verified User in Internet | Mid-Market (51-1000 emp.)

**Reviewed Date:** April 04, 2019

**What do you like best about Pentaho Data Integration?**

PDI (previously known as Kettle) is an excellent data cleansing and transformation tool for the non-programmer. It has an excellent UI for users to build data flows without knowing how to code!

**What do you dislike about Pentaho Data Integration?**

You are limited to the modules and steps that the tool offers. There are excellent modules that is already offered but I heard there are some that you will not find here.

**Recommendations to others considering Pentaho Data Integration:**

Check out the community version.

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

We use PDI for data cleansing and data prepping, especially when data is across multiple environments.

  ### 11. One of the best data science softwares

**Rating:** 5.0/5.0 stars

**Reviewed by:** Vinicios W. | Diretor de assuntos academicos, Program Development, Small-Business (50 or fewer emp.)

**Reviewed Date:** December 26, 2018

**What do you like best about Pentaho Data Integration?**

Pentaho Data Integration graphical ETL simplifies the data pipeline in a level never seen before.

**What do you dislike about Pentaho Data Integration?**

Pentaho Data Integration have the huge advantage to simplifies the data preparation, but the UI is a little messy

**Recommendations to others considering Pentaho Data Integration:**

I heavly recommend Pentaho Data Integration, Pentaho data Integration is an old player in Data field, used by a huge amount of peoples, the key concept of Pentaho Data Integration is the graphical ETL designer that simplifies the data pipeline, with zero programming experience you can do a lot with the Pentaho Data Integration, In my opnion the UI of Pentaho Data integration is a little messy, but this will not mess up you in data processing task

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

Data Cleansing, to apply some machine learning model

  ### 12. Used pentaho for ETL job and to run dataflow in Clarabridge

**Rating:** 4.0/5.0 stars

**Reviewed by:** Shishir B. | Linux Engineer, Telecommunications, Enterprise (> 1000 emp.)

**Reviewed Date:** April 11, 2019

**What do you like best about Pentaho Data Integration?**

It is easy to use with graphical options for various things such as transformation.

**What do you dislike about Pentaho Data Integration?**

There should be better documents for first-time users. When users use the tabs for the first time, it can ve confusing to them. Beside that better community support is required for the application so that enterprise can report bugs easily in the software. 

**Recommendations to others considering Pentaho Data Integration:**

Better documents is needed. 

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

Collecting and analyzing customer experience. 

  ### 13. Just what you need to integrate data in low cost

**Rating:** 4.0/5.0 stars

**Reviewed by:** Victor Hugo G. | Sr. Data Scientist, Enterprise (> 1000 emp.)

**Reviewed Date:** March 05, 2019

**What do you like best about Pentaho Data Integration?**

Pentaho data integration is a multiplatform (Java) ETL tool (and more) it allow their users to intuitively integrate data through multiple data sources, it is easy to use and really powerful

**What do you dislike about Pentaho Data Integration?**

The setup is not the most intuitive, you may manually configure your Java environmental variables and install additional database drivers

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

Data integration is always challenging and this tool makes it really easy, no matter if you are integrating data over a data warehouse or just migrating data

  ### 14. Excellent BI tool

**Rating:** 5.0/5.0 stars

**Reviewed by:** Carlos Andrés C. | Administrador de Infraestructura de Tecnologias de Información, Small-Business (50 or fewer emp.)

**Reviewed Date:** March 07, 2019

**What do you like best about Pentaho Data Integration?**

Excellent BI tool, I think it is very convenient to publish reports for client, it is also very useful to download reports to PDF and excel

**What do you dislike about Pentaho Data Integration?**

In version 4 of pentaho had very interesting things which were lost in the new versions, also the application at that time had better performance

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

We use Pentaho as the report orchestrator, where we use it to show our clients sales reports

  ### 15. Pentaho review

**Rating:** 3.5/5.0 stars

**Reviewed by:** Verified User in Information Technology and Services | Enterprise (> 1000 emp.)

**Reviewed Date:** April 12, 2019

**What do you like best about Pentaho Data Integration?**

Pentaho is a BI tool which can provide Data Integration, reporting , statistics dashboards data mining and Extract Transform Load, ETL tools.

**What do you dislike about Pentaho Data Integration?**

It turns out to be a bit slow as compared to its competitors. Other than that i do not dislike anything about the tool

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

We have used it in our project to integrate and reconciliation of huge data and also to report the stats

  ### 16. Great tool. Easy to learn with infinite value

**Rating:** 3.5/5.0 stars

**Reviewed by:** Verified User in Information Technology and Services | Enterprise (> 1000 emp.)

**Reviewed Date:** February 21, 2019

**What do you like best about Pentaho Data Integration?**

I love the visual ETL builder. PDI offers a very large selections of out of the box steps that fortunately have fantastic documentation behind them. 

**What do you dislike about Pentaho Data Integration?**

I would really like to see the email notification process to be a bit more simple. Paramatizing your email configuration helps, but I still believe it could be easier for the users. 

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

We are doing a lot with PDI. We have found the tool excellent for migrating data between applications. We have recently started to use it to create monitors on data and applications. 

  ### 17. ETL transformation done right

**Rating:** 5.0/5.0 stars

**Reviewed by:** Verified User in Telecommunications | Mid-Market (51-1000 emp.)

**Reviewed Date:** April 11, 2019

**What do you like best about Pentaho Data Integration?**

Ability to inject custom scripts as part of the transformation process

**What do you dislike about Pentaho Data Integration?**

UI/UX is not so friendly. Guess there is improvement on that front.

**What problems is Pentaho Data Integration solving and how is that benefiting you?**

I used it for data migration of millions of customer records for a big organization and I don't know if there was a better way to do it without Pentaho.


## Pentaho Data Integration Discussions
  - [Is there anyway that you can make your own data connector in smartsheet?](https://www.g2.com/discussions/is-there-anyway-that-you-can-make-your-own-data-connector-in-smartsheet) - 1 upvote
  - [The order of the modules in the kettle must be improved.](https://www.g2.com/discussions/13286-the-order-of-the-modules-in-the-kettle-must-be-improved) - 1 upvote

- [View Pentaho Data Integration pricing details and edition comparison](https://www.g2.com/products/pentaho-data-integration/reviews?section=pricing&secure%5Bexpires_at%5D=2026-06-03+15%3A16%3A51+-0500&secure%5Bsession_id%5D=9759263c-5859-4a76-bc68-2ebb158a6ec8&secure%5Btoken%5D=8a936729acf7bd1afac1edd43ef70cf367c0de1f2afe5a862c8f2d56894c6dd3&format=llm_user)
## Pentaho Data Integration Integrations
  - [PostgreSQL](https://www.g2.com/products/postgresql/reviews)

## Pentaho Data Integration Features
**Management**
- Reporting
- Auditing

**Functionality**
- Data Migration
- Data Variety
- Alerts and Logging
- Data Replication

**Functionality**
- Data visualisation
- Data transformation
- Data migration
- Process various formats
- No-code functionality
- Data manipulation
- Analytics
- Reporting

**Database**
- Real-Time Data Collection
- Data Distribution
- Data Lake

**Functionality**
- Extraction
- Transformation
- Loading
- Automation
- Scalability

**Management**
- Backup and Recovery
- Integration Variety
- Access and Security
- Real time Monitoring

**Integrations**
- Hadoop Integration
- Spark Integration

**Security**
- Data Protection

**Platform**
- Machine Scaling
- Data Preparation
- Spark Integration

**Agentic AI - Cloud Migration**
- Autonomous Task Execution
- Multi-step Planning
- Decision Making

**Processing**
- Cloud Processing
- Workload Processing

**Building Reports**
- Data Transformation
- Data Modeling
- WYSIWYG Report Design
- Integration APIs

**Platform**
- Mobile User Support
- Customization 
- User, Role, and Access Management
- Internationalization
- Sandbox / Test Environments
- Performance and Reliability
- Breadth of Partner Applications

## Top Pentaho Data Integration Alternatives
  - [Informatica PowerCenter](https://www.g2.com/products/informatica-powercenter/reviews) - 4.3/5.0 (82 reviews)
  - [IBM DataStage](https://www.g2.com/products/ibm-datastage/reviews) - 4.0/5.0 (62 reviews)
  - [AWS Glue](https://www.g2.com/products/aws-glue/reviews) - 4.3/5.0 (194 reviews)

