[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

Deneche A. Hakim (JIRA) Wed, 19 Aug 2009 10:37:39 -0700

     [ 
https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Deneche A. Hakim updated MAHOUT-145:
------------------------------------

    Attachment: partial_August_19.patch

*Preparation for mahout 0.2*

* moving to Hadoop 0.20.0 API: 
 ** org.apache.mahout.df.mapred.* contains the code compatible with Hadoop 
0.19.1
 ** org.apache.mahout.df.mapreduce.* will contain the code that uses Hadoop 
0.20.0 API 
 ** the in-mem implementation has been converted to 0.20.0 and is working
 ** the partial implementation still need a looot of work to do, but should be 
better (or more likely with better bugs)

> PartialData mapreduce Random Forests
> ------------------------------------
>
>                 Key: MAHOUT-145
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-145
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>            Reporter: Deneche A. Hakim
>            Priority: Minor
>         Attachments: partial_August_10.patch, partial_August_13.patch, 
> partial_August_15.patch, partial_August_17.patch, partial_August_19.patch, 
> partial_August_2.patch, partial_August_9.patch
>
>
> This implementation is based on a suggestion by Ted:
> "modify the original algorithm to build multiple trees for different portions 
> of the data. That loses some of the solidity of the original method, but 
> could actually do better if the splits exposed non-stationary behavior."

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

Reply via email to