Technological partnership

Pentaho: data manipulation and transformation

Discover Pentaho Data Integration (PDI), an open-source ETL tool for designing and executing data manipulation and transformation operations, now in version 5.0.

Pentaho Data Integration (PDI), long known as Kettle, is an open source ETL for designing and executing data manipulation and transformation operations. At the time of writing, Pentaho Data Integration is available in version 5.0.

Thanks to a step-based graphical model, it is possible to create without programming processes composed of data imports and exports, and different transformation operations such as conversions, joins, application of filters , or even executing JavaScript functions. PDI has a large number of connectors, both reading and writing, allowing it to access a large number of databases and all types of files.

In the enterprise version, a scheduler allows you to plan the execution of jobs. A commercial “Agile BI” module also makes it possible to graphically visualize the results of data transformations from the first stages of development.

 

FEATURES

Version studied

  • 5.0

Distributed by

  • Editor (Pentaho)

Licenses

  • Other commercial; LGPL

Technology

  • Java

Pentaho Data Integration is a comprehensive tool with advanced features such as ETL processing clustering.

These features, available from the open source version of PDI, are only found in the commercial versions of competing ETLs.

Pentaho Data Integration is available in LGPL version, the Agile BI module being under commercial license.