Feynman Liang created SPARK-8457:
------------------------------------
Summary: Documentation for N-Gram feature transformer
Key: SPARK-8457
URL: https://issues.apache.org/jira/browse/SPARK-8457
Project: Spark
Issue Type: Documentation
Components: ML
Reporter: Feynman Liang
Priority: Trivial
Documentation for using n-gram feature transformer needs to be written. In
particular, we need to describe the input (ArrayType(StringType)) and output
(ArrayType(ArrayType(StringType)) formats, nullability constraints (input is
nullable, output is not) and behaviors (null values are ignored in generating
n-grams), as well as edge cases (e.g. n-gram length > input sequence length
will yield a single output n-gram equal to the input sequence).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]