[ 
https://issues.apache.org/jira/browse/MAHOUT-60?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12612163#action_12612163
 ] 

Steven Handerson commented on MAHOUT-60:
----------------------------------------

I'm still a newbie at this, just running your code, but I noticed that I was
able to get high utilization (30-40%) on the early map-reduce tasks,
but down to 3% on the fourth map-reduce pass.

Any idea why this would be?  Obviously I'm still playing with the machine
hadoop installation parameters (this is an 8-core 1-chip Sun box).


> Complementary Naive Bayes
> -------------------------
>
>                 Key: MAHOUT-60
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-60
>             Project: Mahout
>          Issue Type: Sub-task
>          Components: Classification
>            Reporter: Robin Anil
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.1
>
>         Attachments: MAHOUT-60.patch, MAHOUT-60.patch, MAHOUT-60.patch, 
> twcnb.jpg
>
>
> The focus is to implement an improved text classifier based on this paper 
> http://people.csail.mit.edu/jrennie/papers/icml03-nb.pdf.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to