[ 
https://issues.apache.org/jira/browse/MAHOUT-4?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12576890#action_12576890
 ] 

Ankur commented on MAHOUT-4:
----------------------------

Thanks for your comment. A few of my replies below:-
> Maybe you might ..
Will make these changes in the next patch update.

> ... - how many cluster numbers do you expect ...?
Well typically I would expect a user:cluster ratio of 1000:1. So for 1 million 
users, 1000 clusters would be created.

In main method, a sample user-story matrix is provided which can be changed to 
experiment. However if required I can write a small unit test case to produce 
randomnly generated user-story matrix but am not sure if that will help better.

> I know EM as ...
I like the idea of general EM framework. Will definitely try to change the code 
so that it reflect EM more generically as suggested.



> Simple prototype for Expectation Maximization (EM)
> --------------------------------------------------
>
>                 Key: MAHOUT-4
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-4
>             Project: Mahout
>          Issue Type: New Feature
>            Reporter: Ankur
>         Attachments: Mahout_EM.patch
>
>
> Create a simple prototype implementing Expectation Maximization - EM that 
> demonstrates the algorithm functionality given a set of (user, click-url) 
> data.
> The prototype should be functionally complete and should serve as a basis for 
> the Map-Reduce version of the EM algorithm.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to