Sqoop Export
The Sqoop Export job allows you to export data from Hadoop into an RDBMS using Apache Sqoop. This job has two setup modes:
- Quick Mode provides the minimum options necessary to perform a successful Sqoop export.
- Advanced Mode's default view provides options for to better control your Sqoop export. Advance Mode also has a command line view which allows you to reuse an existing Sqoop command from the command line.
For additional information about Apache Sqoop, visit http://sqoop.apache.org/.
Quick Setup
Option |
Definition |
---|---|
Name |
The name of this job as it appears in the transformation workspace. |
Namenode Host |
Host name or IP address of the Hadoop NameNode. |
Namenode Port |
Port number of the Hadoop NameNode. |
Jobtracker Host |
Host name of the Hadoop JobTracker. |
Job Tracker Port |
Port number of the Hadoop JobTracker |
Export Directory |
Path of the directory within HDFS to export from. |
Database Connection |
Select the database connection to export to. Clicking Edit... allows you to edit an existing connection or you can create a new connection from this dialog by clicking New.... |
Table |
Destination table to export into. If the source database requires it a schema may be supplied in the format: SCHEMA.TABLE_NAME. This table must exist and its structure must match the input data's format. |
Advanced Setup
Option |
Definition |
---|---|
Default/List view |
List of property and value pair settings which can be modified to suit your needs including options to configure an export from Hive or HBase. |
Command line view |
Field which accepts command line arguments, typically used to allow you to paste an existing Sqoop command line argument. |