Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 16 Next »

INFO

Datameer Spotlight gives organizations fast access and deep visibility into all of their enterprise data assets - whether in the cloud or on-premises - via a single unified self-service platform. With Datameer Spotlight business teams can discover, access, collaborate and analyze more data for faster, more trusted cloud analytics while eliminating complex data movement and maintaining strong governance.

Datameer X Prerequisites

To connect to Datameer Spotlight, the following prerequisites must be fulfilled:

  • your Datameer X instance must be from version 7.4 or higher
  • your Datameer X cluster must be run in grid mode 
  • your Hadoop cluster/ HDFS must be configured to allow access to the data within the HDFS
  • the plug-in 'Datameer DWH API' (including two extensions) must be installed
    NOTE: Please contact the Datameer Service to receive the plug-in.
  • a job must be run at least once in Datameer X and also be kept
  • any data you wish to access through Spotlight must be in the 'Parquet' format (this has been the default format since Datameer 6.3)
  • your user must at least have the role 'ANALYST' to access the data:
    • read-only access to the Datameer folders and files

Datameer X Running on Hadoop/ HDFS

The following applies for running Datameer X on Hadoop/ HDFS:

  • the Datameer X instance must be reachable:
    • the REST API endpoint URL
    • user name
    • password
  • the Hadoop/ HDFS instance must be reachable:
    • the Hadoop NameNodes
    • the Hadoop DataNodes
  • this includes being able to resolve hostnames
  • all ports need to be accessible (Note that the ports might have been changed by your in-house administrators for security reasons. For specific Hadoop vendors other ports might be valid. Please view also the Hadoop vendor documentation): 
    • Hadoop NameNode client port: normally 8020, 9000 or 54310
    • Hadoop DataNodes: normally 50010 and 50020

Datameer X Running on Amazon EMR/ S3

The following applies for running Datameer X von EMR/ S3:

  • Spotlight must be able to read the data files at the S3 folder/ file locations, provided by the Datameer DWH API:
    • access key/ secret must be known in the appropriate account that can read from Datameer X's internal storage bucket
    • have to be set as connection options when setting up the Datameer X connection
  • No labels