NGRAM
Syntax
NGRAM(<string, list>;<number for length>;<number for length>;<number for length>;....)
Description
Returns sub sequences of N characters of a given text. To see sub sequences of N words of a given text, refer to NGRAM_OLD. Datameer tries to match entire record without automatically adding ^ or $ to the regular expression. But if the record has multiple lines, then this must be accounted for with (?s) or line terminators.
Example
Data (string) |
---|
Datameer is the best. |
NGRAM(#RawData!StringColumn;3)
NGRAM() returns |
---|
Dat |
ata |
tam |
ame |
mee |
eer |
er |
r i |
is |
is |
s t |
th |
the |
he |
e b |
be |
bes |
est |
st. |