[ 
https://issues.apache.org/jira/browse/MADLIB-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16427567#comment-16427567
 ] 

ASF GitHub Bot commented on MADLIB-1223:
----------------------------------------

GitHub user njayaram2 opened a pull request:

    https://github.com/apache/madlib/pull/255

     MLP: Remove source table dependency for predicting regression models

    JIRA: MADLIB-1223
    
    We were getting the type of the dependent variable from the
    input table to know if it was an array or not. With this commit, we now get
    this information from the model summary table.
    We also had to add a column in the minibatch preprocessor summary table
    called `dependent_vartype` in order to get the dependent type in the mlp
    code for minibatching.
    
    Co-authored-by: Nikhil Kak <n...@pivotal.io>

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/madlib/madlib 
bugfix/mlp/remove-training-data-dependency

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/madlib/pull/255.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #255
    
----
commit a17a54a4bf4ce0f9b256ccaf32a8a2f61b2c4f18
Author: Nandish Jayaram <njayaram@...>
Date:   2018-04-03T21:32:03Z

    MLP: Remove source table dependency for predicting regression models
    
    JIRA: MADLIB-1223
    
    We were getting the type of the dependent variable from the
    input table to know if it was an array or not. With this commit, we now get
    this information from the model summary table.
    We also had to add a column in the minibatch preprocessor summary table
    called `dependent_vartype` in order to get the dependent type in the mlp
    code for minibatching.
    
    Co-authored-by: Nikhil Kak <n...@pivotal.io>

commit 16a06924dcee1ba273c5e4a49f81ee48cda778cd
Author: Nandish Jayaram <njayaram@...>
Date:   2018-04-04T21:42:38Z

    MLP: Fix bug in array dep var for regression
    
    MLP training for regression fails if the dependent var type is an array.
    This is because the variable dependent_varname was not updated to
    reflect the new column created in the standardized table. This commit
    fixes that issue.
    
    Co-authored-by: Nikhil Kak <n...@pivotal.io>

----


> MLP regression predict fails if input table does not exist
> ----------------------------------------------------------
>
>                 Key: MADLIB-1223
>                 URL: https://issues.apache.org/jira/browse/MADLIB-1223
>             Project: Apache MADlib
>          Issue Type: Bug
>          Components: Module: Neural Networks
>            Reporter: Nikhil
>            Priority: Major
>             Fix For: v1.14
>
>
> If a model is trained with mlp regression and then the input table is 
> dropped, mlp predict fails for that model.
> Ideally the predict function should not depend on the existence of the 
> training data. 
> The predict code for regression only needs to know if the dependent varname 
> type is an array or not. This information can be potentially stored in the 
> model's summary table.
> We also need to make sure that the predict function is backwards compatible. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to