[
https://issues.apache.org/jira/browse/FLINK-14152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Konstantin Knauf updated FLINK-14152:
-------------------------------------
Labels: pull-request-available (was: pull-request-available stale-major)
Removed "stale-critical|major|minor" label in line with
https://issues.apache.org/jira/browse/FLINK-22429.
> Add class for DocCountVectorizerMapper.
> ------------------------------------------
>
> Key: FLINK-14152
> URL: https://issues.apache.org/jira/browse/FLINK-14152
> Project: Flink
> Issue Type: Sub-task
> Components: Library / Machine Learning
> Reporter: Xu Yang
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> DocCountVectorizerModelMapper is a transformer to converts a document
> to a sparse vector based on the document frequency, word count or
> inverse document frequency of each word in the document.
> * Add DocCountVectorizerModelMapper for the operation of the
> DocCountVectorizerModelMapper.
> * Add DocCountVectorizerModelDataConverter to serialize and deserialize
> model.
> * Add DocCountVectorizerPredictParams for the params of
> DocCountVectorizerModelMapper.
> * Add DocCountVectorizerModelMapperTest for the test example.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)