Table of Contents |
---|
Supported Data Types
Format / System | Importing and exporting | Product tested version | Datameer version availability | Notes | ||
---|---|---|---|---|---|---|
Import | Export | Import | Export | |||
Amazon Redshift | 8.2, 8.4 | Native Amazon Redshift JDBC 4.1 driver or a PostgreSQL jdbc driver can be used. | ||||
Apache Avro | Supports default compression types. | |||||
Apache Web Server Logs | ||||||
Azure Blob Storage | (For HDP2.0, 2.2 and CDH+4 users) Contact Datameer services for info. | |||||
Cassandra | 0.6.5 | Public available plugin, see https://github.com/zznate/datameer-cassandra-plugin | ||||
Cobol Copybooks | ||||||
CSV/TSV, etc. | Supports default compression types. | |||||
DB2 | 8.1 (since 1.3.9), 9.7 Express-C | |||||
Excel Workbooks | 2007 and after | |||||
Fixed Width Text | ||||||
Google BigQuery | 7.5 | |||||
Google Spreadsheets | ||||||
Greenplum | HD 1.1, HD 2.1, 4.1.1.1 | |||||
HBase | 0.98.x, 0.96.x, 0.94.x, 0.92.x, 0.90.x | In order to satisfy the classloader requirements, hbase-protocol.jar must be included in Hadoop's classpath and the root Datameer classpath (/etc/custom-jars) for version 0.96.1 to 0.98.0
| ||||
Hive JDBC | ||||||
Hive Metastore | 0.13, 0.12, 0.11, 0.10, 0.9, 0.8, 0.7 | Export file format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat | ||||
HiveServer2 | 0.13, 0.14, 1.1.0.x | |||||
HSQL-DB | ||||||
HTML | ||||||
JSON | ||||||
Log4j Log File | ||||||
MBOX (email archive files) | ||||||
MS IIS Web Server Logs | ||||||
MSSQL | SQL Express 2005 and 2008 | |||||
MySQL | 5.5, 5.6, 5.7 | |||||
Netezza | 6.0.6 | |||||
Oracle | 10g XE, 11g XE | |||||
OpenStack Swift | ||||||
ORC (Optimized Row Columnar) | 7.1 |
| ||||
Parquet | Parquet 2.1 |
| ||||
PostgreSQL | 9.0.x | |||||
PowerBI | ||||||
RCFile (Record Columnar File) | 7.1 | Export supported in conjunction with Hive only for existing partitioned Hive tables. | ||||
Sequence Files with Metadata | Supports default compression types. | |||||
Snowflake | 7.1 | 7.1 | ||||
Spark | ||||||
Sybase IQ | 12.7 | |||||
Tableau TDE | ||||||
Tableau TDSX | ||||||
Tableau Hyper | 7.4 | Hadoop cluster native system library requirements:
| ||||
Teradata | 12 & 13 | Teradata database needs to be configured to support the appropriate character set. | ||||
Teradata Aster | 5.0 | |||||
Text Files | Supports default compression types. | |||||
Vertica | 5, 6.0, 6.1 | |||||
XML |
| Supports default compression types. |
Supported File Protocols
Protocol | Input | Output |
---|---|---|
File | ||
HDFS | ||
SSH (SCP and SFTP) | ||
S3 |
Note | ||
---|---|---|
| ||
Datameer supports Bitverse SSH Server/Client for the Windows platform. The root paths to be specified while creating the connection should look something like: /c:/mydata/folder1 |
Anchor | ||||
---|---|---|---|---|
|
Codec | Input | Output | Default compression | Notes |
---|---|---|---|---|
.gz | ||||
.bz2 | ||||
.lzo | Additional native libraries are required. | |||
Snappy | ||||
.zip |
| |||
.Z |
* LZO: tested with Fedora lzo.i386 v2.02-3.fc8 and and http://github.com/kevinweil/hadoop-lzo (state from 2010-Jun-20)
Supported File Systems
Customer | File system | Special Hadoop configuration |
---|---|---|
appistry | storage:/ | fs.storage.impl=org.apache.hadoop.fs.appistry.FabricStorageFileSystem |