[
https://issues.apache.org/jira/browse/MAHOUT-716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13233818#comment-13233818
]
Isabel Drost commented on MAHOUT-716:
-------------------------------------
After not much activity - took a brief look at the patch. Some comments (to be
taken with a grain of salt after I didn't have the cycles to follow the project
as closely as I would have liked in the past months):
You mentioned a forked git repo on github - is it still online?
So far looks like a rather isolated change. Would it make sense to integrate it
with existing classification APIs e.g.
org.apache.mahout.classifier.AbstractVectorClassifier?
Also some more documentation and a usage example for the un-initiated would be
great: In addition to links to one or two publications the implementation is
based on it's always great to have some information on the strengths and
weaknesses of the implemented solution (yes, I know we are doing pretty badly
along these lines with other bits and pieces we have - still would be nice to
have).
> Implement Boosting
> ------------------
>
> Key: MAHOUT-716
> URL: https://issues.apache.org/jira/browse/MAHOUT-716
> Project: Mahout
> Issue Type: New Feature
> Components: Classification
> Affects Versions: 0.5
> Reporter: Hector Yee
> Assignee: Ted Dunning
> Priority: Minor
> Labels: features
> Attachments: MAHOUT-716.patch, MAHOUT-716.patch
>
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> Implement boosting (grad boost variant) with l1-regularization and induction.
> The gradient part is scalable and parallel and the induction part allows
> stochastic hypothesis generation for speed.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira