Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The optimized previews (smart samples) can only be generated when the workbook is being run, and they are generated for the data source sheets and kept sheets. Optimized preview generation can be disabled by setting the property: das.sampling.lookahead.maxdepth to 0, which can be useful for critical, time constrained workbooks.

The initial sample for data links aren’t representative. Currently, the sample is just the first k records of the data source. Once the data link is used in a workbook, an optimized preview is generated for itDatameer reads 5GB (5368709120) per default of the file and then selects 5000 rows per default as sample data.

The amount of data that Datameer reads for a data link can be adjusted. If a cluster is busy, to improve performance, the amount can be decreased by adding/editing the following custom property :

The value for the property is measured in bytes. E.g., 1073741824 = 1GB

Panel

das.splitting.datalink.max-sample-size=

This property can be added at a server level under the Administration tab -> Hadoop Cluster -> Configuration -> Custom Properties or at a job level through the data link wizard under Schedule -> Advanced -> Custom Properties.

Anchor
sample_partition
sample_partition
Sampling from Partitions

...