[ 
https://issues.apache.org/jira/browse/MAHOUT-228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Dunning updated MAHOUT-228:
-------------------------------

    Attachment: MAHOUT-228.patch

Now has a working and almost useful version of TrainLogistic command line.  

This command line will solve a simple example case that I am working out for 
the Mahout in Action book:

java -cp<mumble> org.apache.mahout.classifier.sgd.TrainLogistic \
        --passes 100 --rate 50 --lambda 0.001 \
        --input donut.csv --features 21 --output foo \
        --target color --categories 2 \
        --predictors  x y xx xy yy a b c --types n n

I still need to 

- output the model
- change prints into log statements
- build the book-end TestLogistic function
- integrate into the mahout command line driver framework and 
- build a DumpResourceData program.

Otherwise, this is beginning to coalesce.

> Need sequential logistic regression implementation using SGD techniques
> -----------------------------------------------------------------------
>
>                 Key: MAHOUT-228
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-228
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>            Reporter: Ted Dunning
>             Fix For: 0.4
>
>         Attachments: logP.csv, MAHOUT-228-3.patch, MAHOUT-228.patch, 
> MAHOUT-228.patch, MAHOUT-228.patch, r.csv, sgd-derivation.pdf, 
> sgd-derivation.tex, sgd.csv
>
>
> Stochastic gradient descent (SGD) is often fast enough for highly scalable 
> learning (see Vowpal Wabbit, http://hunch.net/~vw/).
> I often need to have a logistic regression in Java as well, so that is a 
> reasonable place to start.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to