Supported Data Sources

Supported Data Sources

Supported Data Types

Format / System

Importing and exporting

Product tested version

Datameer version availability

Notes

Import

Export

Import

Export

Amazon Redshift

 

 

8.2, 8.4

 

 

Native Amazon Redshift JDBC 4.1 driver or a PostgreSQL jdbc driver can be used.

Apache Avro

 

 

 

Supports default compression types.

Apache Web Server Logs

 

 

 

 

Azure Blob Storage

 

 

 

(For HDP2.0, 2.2 and CDH+4 users)

Contact Datameer services for info.

Cassandra

0.6.5

 

 

Public available plugin, see https://github.com/zznate/datameer-cassandra-plugin

Cobol Copybooks

 

 

 

 

CSV/TSV, etc.

 

 

 

Supports default compression types.

DB2

8.1 (since 1.3.9), 9.7 Express-C

 

 

 

Excel Workbooks

 

2007 and after

 

 

 

Fixed Width Text

 

 

 

 

Google BigQuery

 

7.5

 

 

Google Spreadsheets

 

 

 

 

Greenplum

 

HD 1.1, HD 2.1, 4.1.1.1

 

 

 

HBase

0.98.x, 0.96.x, 0.94.x, 0.92.x, 0.90.x

 

 

In order to satisfy the classloader requirements, hbase-protocol.jar must be included in Hadoop's classpath and the root Datameer classpath (/etc/custom-jars) for version 0.96.1 to 0.98.0

Hive JDBC

 

 

 

 

 

Hive Metastore

 

 

0.13, 0.12, 0.11, 0.10, 0.9, 0.8, 0.7

 

 

Export file format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat

HiveServer2

 

 

0.13, 0.14, 1.1.0.x

 

 

 

HSQL-DB

 

 

 

 

HTML

 

 

 

 

 

JSON

 

 

 

 

 

Log4j Log File

 

 

 

 

 

 

MBOX (email archive files)

 

 

 

 

MS IIS Web Server Logs

 

 

 

 

MSSQL

SQL Express 2005 and 2008

 

 

 

MySQL

5.5, 5.6, 5.7

 

 

 

Netezza

 

 

6.0.6

 

 

 

Oracle

10g XE, 11g XE

 

 

 

OpenStack Swift

 

 

 

 

ORC (Optimized Row Columnar)

 

 

7.1

  • Only supported in conjunction with Hive.

  • Supports default compression types.

  • Export supported only for existing partitioned Hive tables.

Parquet

 

 

Parquet 2.1

 

 

PostgreSQL

 

 

9.0.x

 

 

 

PowerBI

 

 

 

 

 

RCFile (Record Columnar File)

 

 

 

7.1

Export supported in conjunction with Hive only for existing partitioned Hive tables.

Sequence Files with Metadata

 

 

 

Supports default compression types.

Snowflake

 

7.1

7.1

 

Spark

 

 

 

 

Sybase IQ

 

 

12.7

 

 

 

Tableau TDE

 

 

 

 

Tableau TDSX

 

 

 

 

Tableau Hyper

 

 

7.4

Hadoop cluster native system library requirements:

  • GNU C Library (libc6) version >= 2.15

  • GNU Standard C++ Library v3 (libstdc++6) version >= 6.1.0

Teradata

 

12 & 13

 

 

Teradata database needs to be configured to support the appropriate character set.

Teradata Aster

5.0

 

 

 

Text Files

 

 

 

Supports default compression types.

Vertica

5, 6.0, 6.1

 

 

 

XML

 

 

 

 

Supports default compression types.

 

Supported File Protocols

Protocol

Input

Output

Protocol

Input

Output

File

HDFS

SSH (SCP and SFTP)

S3