Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Info
titleINFO

Secure File Transfer Protocol (SFTP) is a network protocol that provides file accessfile transfer, and file management over any reliable data stream.

Table of Contents

Custom Property Recommendation

Tip
titleTIP

Ingesting many files via the SFTP connection will open and close SSH connections to the server for each file. These connection attempts might be refused with many parallel tasks running, based on the SSH server settings. For that we recommend to set the following custom property:

No Format
'das.cluster.hadoop.fs.cache.scheme.whitelist=scp,sftp'

This allows using the Hadoop FileServer cache on the cluster; with the FileServer cache enabled for SCP/ SFTP scheme, there will be a single SSH connection for each 'das.cluster.hadoop.fs.cache.scheme.whitelist=scp,sftp'.

Prerequisites

Set up the Authentication Mechanism 

...

  1. Click the + (plus) button and select  Connection or right-click in the browser and select Create new > Connection.
  2. From drop-down list, select SFTP as the connection type.
  3. Enter the SFTP host name(s), the port number, authentication credentials, ssh key (if needed), and the root path prefix.
    Select if the connection to be used for import, export, or both.
  4. If required, add a description and click Save.

...