[
https://issues.apache.org/jira/browse/FLINK-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14563243#comment-14563243
]
ASF GitHub Bot commented on FLINK-2102:
---------------------------------------
GitHub user thvasilo opened a pull request:
https://github.com/apache/flink/pull/744
[FLINK-2102] [ml] Add predict operation for LabeledVector
Add predict function for labeled data for SVM and MLR.
These functions return for each example in the input DataSet[LabeledVector]
a pair (Double, Double) - (truth, prediction)
This output can then be used to evaluate the performance of the Predictors
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/thvasilo/flink predict-labeled
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/744.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #744
----
commit a57e464e704fae21b6ebe3c0afed3e21fb15b951
Author: Theodore Vasiloudis <[email protected]>
Date: 2015-05-28T16:51:17Z
Add predict function for labeled data for SVM and MLR.
These functions return for each example in the input DataSet[LabeledVector]
a pair (truth, prediction)
----
> Add predict operation for LabeledVector
> ---------------------------------------
>
> Key: FLINK-2102
> URL: https://issues.apache.org/jira/browse/FLINK-2102
> Project: Flink
> Issue Type: Improvement
> Components: Machine Learning Library
> Reporter: Theodore Vasiloudis
> Assignee: Theodore Vasiloudis
> Priority: Minor
> Labels: ML
> Fix For: 0.9
>
>
> Currently we can only call predict on DataSet[V <: Vector].
> A lot of times though we have a DataSet[LabeledVector] that we split into a
> train and test set.
> We should be able to make predictions on the test DataSet[LabeledVector]
> without having to transform it into a DataSet[Vector]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)