Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Scheduling Jobs with the Cluster

...

Info
titleINFO

Set here how jobs are scheduled for the Hadoop cluster to help optimize jobs and priorities.

No

...

Impersonation

If not using impersonation, you can set the scheduling of jobs for specific cluster queues at either a global or per job level.

Global

...

Level

  1. Open the "Admin" tab.
  2. Select Hadoop "Cluster Configuration" from the side menu.
  3. Add the following property in the "Custom Properties space" section.

    teztitle
    Panel
    titleTez
    Queue Property

    das.job.queue.name=<cluster queue name> 

    Panel

    MapReduce

     mapreduce.job.quename=<cluster queue name>

...


Job Level

  1. Navigate through the job wizard Job Wizard when setting up or configuring a job.
  2. Add the properties listed above in the "Custom Properties space" tab.
    Example:
     

Impersonation

Info
titleINFO

Datameer X users that are running impersonation don't need to set any scheduling properties in Datameer. Jobs coming from Datameer X already are labeled and all configuration for the queues are made on the Hadoop cluster itself. 


Finding the Optimal Split Size/Split Count

The optimal split size and count for a Hadoop job is calculated by Hadoop from the values for max/min split size and max/min split count.

...