[
https://issues.apache.org/jira/browse/FLINK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15901830#comment-15901830
]
Stavros Kontopoulos edited comment on FLINK-5785 at 3/8/17 8:25 PM:
--------------------------------------------------------------------
[~beera] Let me know if you want any kind of help.
was (Author: skonto):
[~beera] If you do that please follow my approach here for raising exceptions:
https://github.com/skonto/flink/blob/6736a66ae1bd2c0efbaa29cf170cabd18b281a8a/flink-libraries/flink-ml/src/main/scala/org/apache/flink/ml/preprocessing/Normalizer.scala#L127
I will finish that PR for unit scaling ASAP.
> Add an Imputer for preparing data
> ---------------------------------
>
> Key: FLINK-5785
> URL: https://issues.apache.org/jira/browse/FLINK-5785
> Project: Flink
> Issue Type: New Feature
> Components: Machine Learning Library
> Reporter: Stavros Kontopoulos
> Assignee: Stavros Kontopoulos
>
> We need to add an Imputer as described in [1].
> "The Imputer class provides basic strategies for imputing missing values,
> either using the mean, the median or the most frequent value of the row or
> column in which the missing values are located. This class also allows for
> different missing values encodings."
> References
> 1. http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing
> 2.
> http://scikit-learn.org/stable/auto_examples/missing_values.html#sphx-glr-auto-examples-missing-values-py
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)