[ 
https://issues.apache.org/jira/browse/FLINK-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14563243#comment-14563243
 ] 

ASF GitHub Bot commented on FLINK-2102:
---------------------------------------

GitHub user thvasilo opened a pull request:

    https://github.com/apache/flink/pull/744

    [FLINK-2102] [ml] Add predict operation for LabeledVector

    Add predict function for labeled data for SVM and MLR.
    
    These functions return for each example in the input DataSet[LabeledVector] 
a pair (Double, Double) - (truth, prediction)
    
    This output can then be used to evaluate the performance of the Predictors

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/thvasilo/flink predict-labeled

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/744.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #744
    
----
commit a57e464e704fae21b6ebe3c0afed3e21fb15b951
Author: Theodore Vasiloudis <[email protected]>
Date:   2015-05-28T16:51:17Z

    Add predict function for labeled data for SVM and MLR.
    
    These functions return for each example in the input DataSet[LabeledVector] 
a pair (truth, prediction)

----


> Add predict operation for LabeledVector
> ---------------------------------------
>
>                 Key: FLINK-2102
>                 URL: https://issues.apache.org/jira/browse/FLINK-2102
>             Project: Flink
>          Issue Type: Improvement
>          Components: Machine Learning Library
>            Reporter: Theodore Vasiloudis
>            Assignee: Theodore Vasiloudis
>            Priority: Minor
>              Labels: ML
>             Fix For: 0.9
>
>
> Currently we can only call predict on DataSet[V <: Vector].
> A lot of times though we have a DataSet[LabeledVector] that we split into a 
> train and test set.
> We should be able to make predictions on the test DataSet[LabeledVector] 
> without having to transform it into a DataSet[Vector]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to