Feynman Liang created SPARK-8457:
------------------------------------

             Summary: Documentation for N-Gram feature transformer
                 Key: SPARK-8457
                 URL: https://issues.apache.org/jira/browse/SPARK-8457
             Project: Spark
          Issue Type: Documentation
          Components: ML
            Reporter: Feynman Liang
            Priority: Trivial


Documentation for using n-gram feature transformer needs to be written. In 
particular, we need to describe the input (ArrayType(StringType)) and output 
(ArrayType(ArrayType(StringType)) formats, nullability constraints (input is 
nullable, output is not) and behaviors (null values are ignored in generating 
n-grams), as well as edge cases (e.g. n-gram length > input sequence length 
will  yield a single output n-gram equal to the input sequence).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to