Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Datameer's Amazon S3 Native connector uses multipart technology to boost the performance. Those using the S3 connector might think about updating to this S3 Native connector.

...

Configuring S3 Native as a Connection

In order to import and export from a custom protocolS3 Native, you must first create a connection.

  1. Click the + (plus) button and select Connection or right-click in the browser and select Create new > Connection.
  2. From drop-down list, select  S3 Native  as the connection type. 
  3. Select the authentication process. 
    If using an access key and secret key, enter those below. 
    If using IAM, Datameer's S3 client uses the instance profile credentials to sign and authenticate the S3 requests.
    Select an option for the algorithm encryption method, enter the AWS region, and enter the S3 bucket name. 

    Anchor
    uploading_parameters
    uploading_parameters
    Enter the uploading parameters.

    Planing for memory usage:

    By default, Datameer starts 3 threads per stream with each stream having a queue capacity of 3 tasks. Each part (the buffer size) is set at 10MB. The result consumes 3x3x10MB = 90MB RAM.

    Size planning:

    Amazon sets a limit of 10,000 parts for any multipart upload job. As a result, each Datameer task can only upload as a 100GB (10,000*10MB = 100GB) file using the default settings. Increase the part size setting if you have a larger file. Note that this is per Datameer task. If you have five exporting tasks, the original worksheet data is divided into five with each having a 100GB limit.

    These settings can be overridden in each export job.

  4. If required, add a description and click Save.

...