Title: Pentaho Data Integration Steps  
Owner: Former user
Creator: Former user Sept 21, 2007
Last Changed by: Virginia Agnew Aug 13, 2021
Tiny Link: (useful for email) https://pentaho-public.atlassian.net/wiki/x/jQi3FQ
Export As: Word · PDF  
Pentaho Data Integration (3)
    Page: .09 Transformation Steps
    Page: Getting Started
    Home page: Latest Pentaho Data Integration (aka Kettle) Documentation
Hierarchy
Children (251)
    Page: Closure Generator
    Page: Data Validator
    Page: Excel Input Step
    Page: Switch-Case
    Page: XML Join
    Page: Metadata Structure
    Page: Add XML
    Page: Text File Output (Deprecated)
    Page: Generate Random Value
    Page: Text File Input
Labels
Global Labels (2)
Outgoing Links
External Links (3)
    ci.pentaho.com/view/Data%20Integration/job/Kettle/
    https://help.hitachivantara.com/Documentation/Pentaho
    https://wiki.pentaho.com/display/EAI/Abort
Pentaho Data Mining (1)     Page: Using the Univariate Statistics Plugin
Pentaho Data Integration (232)     Page: Greenplum Load
    Page: Salesforce Insert
    Page: String operations
    Page: MonetDB Agile Mart
    Page: Process files
    Page: SSTable Output
    Page: Cassandra Input
    Page: Blocking step
    Page: JSON output
    Page: Salesforce Delete
    Page: XBase Input
    Page: Yaml Input
    Page: Stream Lookup
    Page: Sorted Merge
    Page: GZIP CSV Input
    Page: XML Input
    Page: IBM Websphere MQ Producer (Deprecated)
    Page: RSS Input
    Page: Switch-Case
    Page: Prioritize streams
    Page: IBM Websphere MQ Consumer (Deprecated)
    Page: Split Fields
    Page: Symmetric Cryptography
    Page: Mail (step)
    Page: SQL File Output
    Page: Filter Rows
    Page: HL7 Input
    Page: Transformation Executor
    Page: Teradata TPT Insert Upsert Bulk Loader
    Page: Palo Cell Output (Deprecated)
    Page: MapReduce Output
    Page: JMS Producer (Deprecated)
    Page: Check if webservice is available
    Page: Call Endpoint
    Page: RSS Output
    Page: Add Constants
    Page: Set files in result
    Page: Example plugin (Transformation Step)
    Page: JMS Consumer (Deprecated)
    Page: Get previous row fields
    Page: XML Output
    Page: Table Input
    Page: Palo Dimension Input (Deprecated)
    Page: OpenERP Object Input (Deprecated)
    Page: Streaming XML Input
    Page: Single Threader
    Page: Fuzzy match
    Page: Text File Input
    Page: Reservoir Sampling
    Page: PostgreSQL Bulk Loader
    Page: Salesforce Upsert
    Page: Socket writer
    Page: Clone row
    Page: Email Messages Input
    Page: Property Input
    Page: HBase Output
    Page: HBase Row Decoder
    Page: Add XML
    Page: Execute a process
    Page: Rule Accumulator
    Page: Salesforce Update
    Page: Job Executor
    Page: Multiway Merge Join
    Page: Set field value
    Page: Java Filter
    Page: Dummy (do nothing)
    Page: Call DB Procedure
    Page: Append streams
    Page: Closure Generator
    Page: Insert - Update
    Page: Knowledge Flow
    Page: LDIF Input
    Page: OLAP Input
    Page: Set Session Variables
    Page: Select Values
    Page: Get Files Rows Count
    Page: MySQL Bulk Loader
    Page: Mapping Output
    Page: Output Steps Metrics
    Page: Dynamic SQL row
    Page: Generate Random Value
    Page: Get SubFolder names
    Page: LucidDB bulk loader
    Page: Number range
    Page: Get table names
    Page: MongoDB Input
    Page: Row denormaliser
    Page: HBase Input
    Page: LDAP Input
    Page: HTTP Client
    Page: CSV File Input
    Page: Get File Names
    Page: Merge rows
    Page: Write to log (step)
    Page: Get ID from Slave Server
    Page: Mondrian Input
    Page: Identify last row in a stream
    Page: Execute SQL script
    Page: Table Compare
    Page: Regex Evaluation
    Page: Edi to XML
    Page: Avro Output
    Page: Infobright Loader
    Page: LucidDB Streaming Loader (Deprecated)
    Page: XSL Transformation
    Page: LDAP Output
    Page: Send message to Syslog
    Page: Splunk Input
    Page: Unique Rows (HashSet)
    Page: S3 File Output
    Page: Microsoft Excel Writer
    Page: Delay row
    Page: Row Normaliser
    Page: Run SSH commands
    Page: SAP Input (Deprecated)
    Page: Flattener
    Page: Change file encoding
    Page: Credit card validator
    Page: Simple Mapping
    Page: Get System Info
    Page: OpenERP Object Output (Deprecated)
    Page: ETL Metadata Injection
    Page: Sort rows
    Page: Data Grid
    Page: Access Input
    Page: Palo Dimension Output (Deprecated)
    Page: Group By
    Page: Teradata Fastload Bulk Loader
    Page: SAS Input
    Page: Oracle Bulk Loader
    Page: XML Input Stream (StAX)
    Page: MonetDB bulk loader
    Page: Script
    Page: Analytic Query
    Page: Combination lookup-update
    Page: Rule Executor
    Page: ElasticSearch Bulk Insert
    Page: Vertica Bulk Loader
    Page: Splunk Output
    Page: XSD Validator
    Page: Value Mapper
    Page: Text File Output (Deprecated)
    Page: Access Output
    Page: Table Output
    Page: JSON Input
    Page: Web services lookup
    Page: Metadata Structure of Stream
    Page: Update
    Page: Secret Key Generator
    Page: Modified Java Script Value
    Page: Delete
    Page: Block this step until steps finish
    Page: Cassandra Output
    Page: Check if a column exists
    Page: User Defined Java Expression
    Page: Mail Validator
    Page: Get rows from result
    Page: De-serialize from file
    Page: Hadoop File Input
    Page: Hadoop File Output
    Page: Google Analytics
    Page: Join Rows (Cartesian product)
    Page: Generate Rows
    Page: Zip file (step)
    Page: Database Join
    Page: User Defined Java Class
    Page: Concat Fields
    Page: File exists
    Page: S3 CSV Input
    Page: Add sequence
    Page: R script executor
    Page: Table Agile Mart
    Page: ESRI Shapefile Reader
    Page: Salesforce Input
    Page: Add a checksum
    Page: Pentaho Reporting Output
    Page: Aggregate Rows
    Page: Merge Join
    Page: ARFF Output
    Page: If field value is null
    Page: Get files from result
    Page: Execute row SQL script
    Page: Excel Output
    Page: Sample rows
    Page: Data Validator
    Page: Excel Input (XLS, XLSX) including OpenOffice Workbooks (ODS)
    Page: Greenplum
    Page: Database lookup
    Page: Set field value to a constant
    Page: HTTP Post
    Page: Check if file is locked
    Page: OpenERP Object Delete (Deprecated)
    Page: Detect empty stream
    Page: SAP HANA Bulk Loader
    Page: Get repository names
    Page: Socket reader
    Page: Null If
    Page: Fixed File Input
    Page: Generate random credit card numbers
    Page: Properties Output
    Page: Avro Input (Deprecated)
    Page: Unique Rows
    Page: Load file content in memory
    Page: Strings cut
    Page: Rest Client
    Page: Table Exists
    Page: Synchronize after merge
    Page: Get Data From XML
    Page: Automatic Documentation Output
    Page: Memory Group by
    Page: Mapping
    Page: Split field to rows
    Page: CouchDB Input
    Page: MongoDB Output
    Page: Palo Cell Input (Deprecated)
    Page: Dimension Lookup-Update
    Page: Get Variable
    Page: Set Variables
    Page: SFTP Put
    Page: Add value fields changing sequence
    Page: Serialize to file
    Page: Mapping Input
    Page: Ingres VectorWise Bulk Loader
    Page: MapReduce Input
    Page: Google Docs Input
    Page: Formula
    Page: Calculator
    Page: Copy rows to result
    Page: Replace in String
    Page: XML Join
    Page: Injector
    Page: Get Session Variables