[ 
https://issues.apache.org/jira/browse/MAHOUT-918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164561#comment-13164561
 ] 

Ted Dunning commented on MAHOUT-918:
------------------------------------

Can you post this as a review board review.  There are lots of comments to be 
made.

At a high level, I note the following issues:

1) I don't see a design document.  You cite a few articles but you don't say 
what you are really doing.

2) Is map-reduce an appropriate approach here for model averaging?

3) How do you plan to deal with randomization of data order?

4) There are a number of style issues:

   a) you have loops that look like this:
{code}
          for (...) {
             if (something) {
                ... stuff ...
                continue;
             }
             ... other stuff ...
             break;
          }
{code}
This is slightly perverse and is akin to using goto statements.  Much better is 
this:
{code}

          for (...) {
             if (something) {
                ... stuff ...
             } else {
                ... other stuff ...
                break;
             }
          }
{code}

                
> Implement SGD based classifiers using MapReduce
> -----------------------------------------------
>
>                 Key: MAHOUT-918
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-918
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.6
>            Reporter: issei yoshida
>         Attachments: MAHOUT-918.patch
>
>
> Implement SGD based classifiers (Logistic Regression, Adaptive Logistic 
> regression and Passive-Aggressive) using MapReduce.
> They are implemented using Iterative Parameter Mixtures algorithm which is 
> referred to in the following papers.
> http://research.google.com/pubs/pub36948.html
> http://aclweb.org/anthology-new/N/N10/N10-1069.pdf
> http://books.nips.cc/papers/files/nips22/NIPS2009_0345.pdf

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to