Supported Data Sources
Supported Data Types
Format / System | Importing and exporting | Product tested version | Datameer version availability | Notes | ||
---|---|---|---|---|---|---|
Import | Export | Import | Export | |||
Amazon Redshift | Â | Â | 8.2, 8.4 | Native Amazon Redshift JDBC 4.1 driver or a PostgreSQL jdbc driver can be used. | ||
Apache Avro | Supports default compression types. | |||||
Apache Web Server Logs | ||||||
Azure Blob Storage | (For HDP2.0, 2.2 and CDH+4 users) Contact Datameer services for info. | |||||
Cassandra | 0.6.5 | Public available plugin, see https://github.com/zznate/datameer-cassandra-plugin | ||||
Cobol Copybooks | ||||||
CSV/TSV, etc. | Supports default compression types. | |||||
DB2 | 8.1 (since 1.3.9), 9.7 Express-C | |||||
Excel Workbooks | Â | 2007 and after | ||||
Fixed Width Text | ||||||
Google BigQuery | 7.5 | |||||
Google Spreadsheets | ||||||
Greenplum | Â | HD 1.1, HD 2.1, 4.1.1.1 | ||||
HBase | 0.98.x, 0.96.x, 0.94.x, 0.92.x, 0.90.x | In order to satisfy the classloader requirements, hbase-protocol.jar must be included in Hadoop's classpath and the root Datameer classpath (/etc/custom-jars) for version 0.96.1 to 0.98.0
| ||||
Hive JDBC | Â | |||||
Hive Metastore | Â | Â | 0.13, 0.12, 0.11, 0.10, 0.9, 0.8, 0.7 | Export file format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat | ||
HiveServer2 | Â | Â | 0.13, 0.14, 1.1.0.x | |||
HSQL-DB | ||||||
HTML | Â | |||||
JSON | Â | |||||
Log4j Log File | Â | Â | ||||
MBOX (email archive files) | ||||||
MS IIS Web Server Logs | ||||||
MSSQL | SQL Express 2005 and 2008 | |||||
MySQL | 5.5, 5.6, 5.7 | |||||
Netezza | Â | Â | 6.0.6 | |||
Oracle | 10g XE, 11g XE | |||||
OpenStack Swift | ||||||
ORC (Optimized Row Columnar) | 7.1 |
| ||||
Parquet | Â | Â | Parquet 2.1 |
| ||
PostgreSQL | Â | Â | 9.0.x | |||
PowerBI | Â | |||||
RCFile (Record Columnar File) | Â | 7.1 | Export supported in conjunction with Hive only for existing partitioned Hive tables. | |||
Sequence Files with Metadata | Supports default compression types. | |||||
Snowflake | 7.1 | 7.1 | ||||
Spark | ||||||
Sybase IQ | Â | Â | 12.7 | |||
Tableau TDE | ||||||
Tableau TDSX | ||||||
Tableau Hyper | 7.4 | Hadoop cluster native system library requirements:
| ||||
Teradata | Â | 12 & 13 | Teradata database needs to be configured to support the appropriate character set. | |||
Teradata Aster | 5.0 | |||||
Text Files | Supports default compression types. | |||||
Vertica | 5, 6.0, 6.1 | |||||
XML | Â | Supports default compression types. |
Supported File Protocols
Protocol | Input | Output |
---|---|---|
File | ||
HDFS | ||
SSH (SCP and SFTP) | ||
S3 |
Supported file compression codecs
Codec | Input | Output | Default compression | Notes |
---|---|---|---|---|
.gz | ||||
.bz2 | ||||
.lzo | Additional native libraries are required. | |||
Snappy | ||||
.zip |
| |||
.Z |
* LZO: tested with Fedora lzo.i386 v2.02-3.fc8 and http://github.com/kevinweil/hadoop-lzo (state from 2010-Jun-20)
Supported File Systems
Customer | File system | Special Hadoop configuration |
---|---|---|
appistry | storage:/ | fs.storage.impl=org.apache.hadoop.fs.appistry.FabricStorageFileSystem |