GROUP_JACCARD_DIST

GROUP_JACCARD_DIST

This function has been removed as of Datameer v6.3

Syntax

GROUP_JACCARD_DIST(<string>)

Description

Computes the Jaccard distance of all values in a group. The Jaccard distance between two sets is the ratio of the size of their intersection to the size of their union. For example, the Jaccard distance between two species is 1-(number of regions where both species are present)/(number of regions where at least one species is present).

This is an aggregate function.

Example

Given the following data:

Groups

 

Groups

 

group1

a

group1

b

group1

c

group1

c

group1

e

group1

g

group1

g

group1

g

group2

a

group2

c

group2

d

group2

h

group2

h

group2

h

group2

m

group2

n

First create a group using GROUPBY(#RawData!Groups)

Groups

Groups

group1

group2

Then use the GROUP_JACCARD_DIST(#RawData!Letters), and the result is the Jaccard distance of the group created from the column in relation to the GROUPBY() column.

Groups

JACCARD_DIST

Groups

JACCARD_DIST

group1

0.4

group2

0.166666666666666666