Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Description

...

 

Use the HTTP job entry to get retrieve a file from a web Web server using the HTTP protocol. This job entry could be used to access data on partner Web sites.

Options

...

Option

...

Description

...

For example, the daily data export or daily list of customers is located at a specified Web site. Also, SaaS providers may give you a URL to locate a report. You can call that URL to retrieve an Excel file or zip file that contains the data. Salesforce requires that you use SOAP APIs to retrieve data.

If HTTP traffic is too heavy in your corporate environment, you may choose to use a proxy server with HTTP authentication.

General

Option

Definition

Name of job entry

The name of the job entry. This name has to be unique in a single job. A job entry can be placed several times on the canvas, however it will be the same job entry.

URL

The HTTP URL to use of the file to retrieve, or the directory name to store an uploaded file to (for example: http://kettle.pentaho.org)

Run for every result row?

Check this if you want to run this job entry for every row that was generated by a previous transformation. Use the "Copy rows to result". If selected, an HTTP request will be made for each result. Otherwise, the file is only retrieved once

Input field which contains URL

The fieldname in the result rows to get the URL from

Target filename

The target filename. If the previous option is selected, the field specified here will determine the file URL for each row

Username

If the site requires authentication, use this username to log in

Password

If a username is defined, this is the password for it

Proxy server for upload

The URL of a proxy server that you want to connect to the HTTP URL through

Proxy port

If a proxy server is defined, this is the port number it listens on

Ignore proxy for hosts

A regular expression list of exceptions for proxy redirection. This may be useful when working on an intranet or VPN

Upload file

If you are uploading a file, this will be its name on the remote server

Target file

If you are downloading a file, this its name on your local filesystem

Append to specified target file Append to ?

If selected, and if the target file if it already exists, PDI will append all new data to the end of the file

Add date and time to target filename Check this if you want to add file name?

If selected, the date and time of the HTTP request (in yyyMMdd_HHmmss format) will be added to the target filename .

Target filename file extension

Specify the target filename extension in case you're adding a date and time to the filename

Upload file

 

Username

The username to authenticate with. For Windows Domains, put the Domain in from of the user like this DOMAIN\Username

Password

The password to authenticate with.

Proxy server for upload

The HTTP proxy server name or IP address

Proxy port

The HTTP proxy port to use (usually 8080)

Ignore proxy for hosts

Specify a regular expression matching the hosts you want to ignore, separated. For example 127\.0\..* If the previous option is selected, this field specifies the extension (letters after the dot) of the target filename

Add filename to result filename

Any files that are copied will appear as a result from this step; shows a list of files that were copied in this step

Headers

Option

Definition

#

Order that the header should be processed

Name

The name of this HTTP header

Value

The actual header value to pass