[ 
https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751842#action_12751842
 ] 

Deneche A. Hakim edited comment on MAHOUT-145 at 9/6/09 2:52 AM:
-----------------------------------------------------------------

bq.    *  TODO: test the code on a Hadoop 0.20.0 cluster (EC2)

Looks like I'll have to wait till Hadoop 0.20.1 to be able to test on 
EC2...after creating my own AMI (with a lot of pain, being a noob), I stumbled 
upon the following bug 
[HADOOP-5921|http://issues.apache.org/jira/browse/HADOOP-5921]



      was (Author: adeneche):
    bq.    *  TODO: test the code on a Hadoop 0.20.0 cluster (EC2)

Looks like I'll have to wait till Hadoop 0.20.1 to be able to test on 
EC2...after creating my own API (with a lot of pain, being a noob), I stumbled 
upon the following bug 
[HADOOP-5921|http://issues.apache.org/jira/browse/HADOOP-5921]


  
> PartialData mapreduce Random Forests
> ------------------------------------
>
>                 Key: MAHOUT-145
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-145
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>            Reporter: Deneche A. Hakim
>            Priority: Minor
>         Attachments: partial_August_10.patch, partial_August_13.patch, 
> partial_August_15.patch, partial_August_17.patch, partial_August_19.patch, 
> partial_August_2.patch, partial_August_24.patch, partial_August_27.patch, 
> partial_August_31.patch, partial_August_9.patch
>
>
> This implementation is based on a suggestion by Ted:
> "modify the original algorithm to build multiple trees for different portions 
> of the data. That loses some of the solidity of the original method, but 
> could actually do better if the splits exposed non-stationary behavior."

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to