[
https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Deneche A. Hakim updated MAHOUT-145:
------------------------------------
Attachment: partial_August_13.patch
*Preparing the code for GSoC deadline*
* DONE: move rf.RFUtils.storeWritable() to rf.ref.tools.Describe, becomes
private
* DONE: rename rf.mapred.partial.InterResults.loadForest/storeForest to
load/store
* DONE: delete rf.mapred.partial.Step0Job and the corresponding tests
* DONE: delete rf.ref.examples.DataSplit
* DONE: DefaultTreeBuilder uses OptIgSplit by default
** DONE: remove unnecessary calls to DefaultTreeBuilder.setIgSplit()
> PartialData mapreduce Random Forests
> ------------------------------------
>
> Key: MAHOUT-145
> URL: https://issues.apache.org/jira/browse/MAHOUT-145
> Project: Mahout
> Issue Type: New Feature
> Components: Classification
> Reporter: Deneche A. Hakim
> Priority: Minor
> Attachments: partial_August_10.patch, partial_August_13.patch,
> partial_August_2.patch, partial_August_9.patch
>
>
> This implementation is based on a suggestion by Ted:
> "modify the original algorithm to build multiple trees for different portions
> of the data. That loses some of the solidity of the original method, but
> could actually do better if the splits exposed non-stationary behavior."
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.