Preface

The purpose of this paper is to provide a technical overview of the PDI Terafast Bulkloader Plugin.

Note: We recommend that you use Teradata TPT Insert Upsert Bulk Loader instead of this step. TPT brings combines FastLoad, MultiLoad, FastExport and TPump Teradata bulk loads. Teradata no longer updates the other bulk loaders.

Definitions

PDI

Pentaho Data Integration.

Fastload

Teradata Fastload command line utility.

Terafast

Teradata Fastload PDI Plugin.

Prerequisites

In order to use the Pentaho Terafast Bulkloader Plugin you need the following software installed:

Installation

Extract the ZIP-Archive Terafast-PDI-Piugin.zip and move following files/folders to the correct directory:

The TeraFast Plugin is now installed. It can be found within PDI Transformations in step category "Bulk Loading"

Functional Scope

The Pentaho TeraFast PDI Plugin supports fastloading data into a Teradata database. This functionality is achieved by using the Fastload command line tool from Teradata.
Basically the plugin supports 2 operating modes:

Fastload Control File

Interactive

GUI Usage

  1. Step name ... The PDI name for the Teradata Fastload Step.
  2. Use control file ... Work in control file mode. (see chapter Functional Scope).
  3. Control file ... The path to the control file to be used.
  4. Variable Substitution in control file ... Make use of POI Variables (eg: ${RUN_ID}) in the control file.
  5. Path to fastload ... The path to the fastload command line utility.
  6. Error log ... An optional Error log to be created by Fastload.
  7. Connection ... A connection to the Teradata DB.
  8. Target table ... The table to be loaded.
  9. Truncate table ... Truncate the target table before loading.
  10. Data file ... The name of the temporary data file.
  11. Sessions ... Number of sessions to be used by Fastload.
  12. Error limit ... The error limit for Fastload.
  13. Field mapping ... Definition of POI <-> DB Field mapping.

Note: Options 7 - 13 are only available in interactive mode.

Known Problems and Planned Features

The streaming mode lacks performance at the moment. This is mainly caused by known problems in the file writing process and on Linux (*nix) the Plugin could take advantage of a FIFO pipe that will allow the loading of the file with Fastload while it is still being written.

These improvements to the Plugin will be done later, when we have the time for it.

If these improvements are urgent to you, or need any other PDI-Plugin (or other Pentaho extensions), don’t hesitate to contact us at pentaho@aschauer-edv.at

Get in Contact

For more information on technical considerations or future planning of the Plugin, or if you want to contribute in developing further releases please contact Aschauer EDV via the follow e-mail address: pentaho@aschauer-edv.at

Attached is the documentation as provided by Aschauer EDV

Link to attachment: Terafast_Technical_Overview.pdf