Supported Data Sources
Supported Data Types
Format / System | Importing and exporting | Product tested version | Datameer version availability | Notes | ||
|---|---|---|---|---|---|---|
| Import | Export | Import | Export | |||
| Amazon Redshift | 8.2, 8.4 | Native Amazon Redshift JDBC 4.1 driver or a PostgreSQL jdbc driver can be used. | ||||
| Apache Avro | Supports default compression types. | |||||
| Apache Web Server Logs | ||||||
| Azure Blob Storage | (For HDP2.0, 2.2 and CDH+4 users) Contact Datameer services for info. | |||||
| Cassandra | 0.6.5 | Public available plugin, see https://github.com/zznate/datameer-cassandra-plugin | ||||
| Cobol Copybooks | ||||||
| CSV/TSV, etc. | Supports default compression types. | |||||
| DB2 | 8.1 (since 1.3.9), 9.7 Express-C | |||||
| Excel Workbooks | 2007 and after | |||||
| Fixed Width Text | ||||||
| Google BigQuery |
| 7.5 | ||||
| Google Spreadsheets |
| |||||
| Greenplum | HD 1.1, HD 2.1, 4.1.1.1 | |||||
| HBase | 0.98.x, 0.96.x, 0.94.x, 0.92.x, 0.90.x | In order to satisfy the classloader requirements, hbase-protocol.jar must be included in Hadoop's classpath and the root Datameer classpath (/etc/custom-jars) for version 0.96.1 to 0.98.0
| ||||
| Hive JDBC | ||||||
| Hive Metastore | 0.13, 0.12, 0.11, 0.10, 0.9, 0.8, 0.7 | Export file format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat | ||||
| HiveServer2 | 0.13, 0.14, 1.1.0.x | |||||
| HSQL-DB | ||||||
| HTML | ||||||
| JSON | ||||||
| Log4j Log File | ||||||
| MBOX (email archive files) | ||||||
| MS IIS Web Server Logs | ||||||
| MSSQL | SQL Express 2005 and 2008 | |||||
| MySQL | 5.5, 5.6, 5.7 | |||||
| Netezza | 6.0.6 | |||||
| Oracle | 10g XE, 11g XE | |||||
| OpenStack Swift | ||||||
| ORC (Optimized Row Columnar) | 7.1 |
| ||||
| Parquet | Parquet 2.1 |
| ||||
| PostgreSQL | 9.0.x | |||||
| PowerBI | ||||||
| RCFile (Record Columnar File) | 7.1 | Export supported in conjunction with Hive only for existing partitioned Hive tables. | ||||
| Sequence Files with Metadata | Supports default compression types. | |||||
| Snowflake | 7.1 | 7.1 | ||||
| Spark | ||||||
| Sybase IQ | 12.7 | |||||
| Tableau TDE | ||||||
| Tableau TDSX | ||||||
| Tableau Hyper | 7.4 | Hadoop cluster native system library requirements:
| ||||
| Teradata | 12 & 13 | Teradata database needs to be configured to support the appropriate character set. | ||||
| Teradata Aster | 5.0 | |||||
| Text Files | Supports default compression types. | |||||
Vertica |
|
| 5, 6.0, 6.1 | |||
XML |
|
| Supports default compression types. | |||
Supported File Protocols
Protocol | Input | Output |
|---|---|---|
File |
|
|
HDFS |
|
|
SSH (SCP and SFTP) |
|
|
S3 |
|
|
Supported file compression codecs
Codec | Input | Output | Default compression | Notes |
|---|---|---|---|---|
.gz |
|
| ||
.bz2 |
|
| ||
.lzo |
|
| Additional native libraries are required. | |
| Snappy |
| |||
.zip |
|
| ||
| .Z |
* LZO: tested with Fedora lzo.i386 v2.02-3.fc8 and http://github.com/kevinweil/hadoop-lzo (state from 2010-Jun-20)
Supported File Systems
Customer | File system | Special Hadoop configuration |
|---|---|---|
appistry | storage:/ | fs.storage.impl=org.apache.hadoop.fs.appistry.FabricStorageFileSystem |