Azure Data Lake Storage Gen 2

INFO

Azure Data Lake Storage Gen2 is a set of capabilities dedicated to big data analytics, built on Azure Blob storage. Data Lake Storage Gen2 is the result of converging the capabilities of the two existing storage services, Azure Blob storage and Azure Data Lake Storage Gen1.

Prerequisites

Preparing the Azure Data Lake Storage

INFO

Prepare your Azure Data Lake Storage instance in your account under https://portal.azure.com/. Find more information about the preparation here and here.

Having the Plug-In Installed 

INFO

The plug-in 'Azure Blob Storage' must be installed in the 'Admin' tab. This comes automatically with your Datameer X distribution.

Configuring Azure Data Lake Storage Gen 2 as a Connection 

To configure Azure Data Lake Storage Gen 2 as a connector:

  1. Click the "+" button and select "Connection" or right-click in the File Browser and select "Create New" → "Connection"The "New Connection" tab appears in the menu bar.
     or 
  2. Select "Azure Blob Storage" from the drop-down and confirm with "Next"The type is displayed in the drop-down.
     
  3. Enter the storage account name. 
     
  4. Enter the name of the storage container.
  5. Enter the storage access key.
  6. Enter the root path prefix. 
  7. Select your data transfer channel from the drop-down. 
  8. Select if you want to use the connection for import, export or both and confirm with "Next". The 'Save Connection' tab opens. 

  9. If needed, enter a description and confirm with "Next"The 'Save Connection' dialog opens.
     
  10. Select the folder to save the connection, enter a name in "Save as" and confirm with "Save"The connection is saved. Configuring the Azure Data Lake Storage Gen 2 connection is finished. 

Importing Data with a Azure Data Lake Storage Connector

INFO

Find how to import from Azure Data Lake Storage here.

Exporting Data with a Azure Data Lake Storage Connector

INFO

Find how to export to Azure Data Lake Storage here.