Libsvm has a few I can check in
On May 22, 2011 4:36 PM, "Lance Norskog" <goks...@gmail.com> wrote:
>
> What is a good regression test for this? Not a unit test, but
> something that demonstrates the algorithms in action at the amount of
> data where they become useful?
>
> Preferably from a small dataset.
>
> On 5/21/11, Ted Dunning <ted.dunn...@gmail.com> wrote:
> > On Sat, May 21, 2011 at 4:25 PM, Hector Yee <hector....@gmail.com>
wrote:
> >
> >> Sure, or I can wait till you submit patches before working on the next
> >> one?
> >>
> >
> > I think that submit == commit.
> >
> > But in any case, don't wait for anything. Find ways forward. We are in
the
> > middle of a release cycle right now so nothing new is going to be
committed
> > for a little while (another week, possibly).
> >
> >
> >> How would the github repo work? I just clone the apache git version and
> >> check it in there?
> >>
> >
> > Yes. Exactly. And if you want me to help rebasing to track trunk, give
me
> > a committer bit. That won't be very necessary, of course, while trunk
is
> > frozen.
> >
> > Then periodically, you can use [git diff --no-prefix trunk] to dump a
patch
> > that can be added to the JIRA. That will allow non-git users to track
> > progress as well.
> >
> >
> >
> >>
> >> On Sun, May 22, 2011 at 3:41 AM, Ted Dunning <ted.dunn...@gmail.com>
> >> wrote:
> >>
> >> > Hector,
> >> >
> >> > You are working on a variety of things here that have
interdependencies.
> >> >
> >> > What would you think about a github repo where you can keep track of
> >> > them
> >> > with multiple branches and we can all avoid problems with patches not
> >> > applying.
> >> >
> >> > If you like, I can help out keeping your branches up to date relative
to
> >> > trunk.
> >> >
> >> > On Sat, May 21, 2011 at 1:54 AM, Hector Yee (JIRA) <j...@apache.org>
> >> > wrote:
> >> >
> >> > >
> >> > > [
> >> > >
> >> >
> >>
https://issues.apache.org/jira/browse/MAHOUT-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037289#comment-13037289
> >> > ]
> >> > >
> >> > > Hector Yee commented on MAHOUT-703:
> >> > > -----------------------------------
> >> > >
> >> > > Note: This patch requires 702 for the OnlineBaseTest.
> >> > >
> >> > > > Implement Gradient machine
> >> > > > --------------------------
> >> > > >
> >> > > > Key: MAHOUT-703
> >> > > > URL:
> >> https://issues.apache.org/jira/browse/MAHOUT-703
> >> > > > Project: Mahout
> >> > > > Issue Type: New Feature
> >> > > > Components: Classification
> >> > > > Affects Versions: 0.6
> >> > > > Reporter: Hector Yee
> >> > > > Priority: Minor
> >> > > > Labels: features
> >> > > > Fix For: 0.6
> >> > > >
> >> > > > Attachments: MAHOUT-703.patch
> >> > > >
> >> > > > Original Estimate: 72h
> >> > > > Remaining Estimate: 72h
> >> > > >
> >> > > > Implement a gradient machine (aka 'neural network) that can be
used
> >> for
> >> > > classification or auto-encoding.
> >> > > > It will just have an input layer, identity, sigmoid or tanh
hidden
> >> > layer
> >> > > and an output layer.
> >> > > > Training done by stochastic gradient descent (possibly mini-batch
> >> > later).
> >> > > > Sparsity will be optionally enforced by tweaking the bias in the
> >> hidden
> >> > > unit.
> >> > > > For now it will go in classifier/sgd and the auto-encoder will
wrap
> >> it
> >> > in
> >> > > the filter unit later on.
> >> > >
> >> > > --
> >> > > This message is automatically generated by JIRA.
> >> > > For more information on JIRA, see:
> >> > http://www.atlassian.com/software/jira
> >> > >
> >> >
> >>
> >>
> >>
> >> --
> >> Yee Yang Li Hector
> >> http://hectorgon.blogspot.com/ (tech + travel)
> >> http://hectorgon.com (book reviews)
> >>
> >
>
>
> --
> Lance Norskog
> goks...@gmail.com