[ 
https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12659260#action_12659260
 ] 

Ted Dunning commented on MAHOUT-30:
-----------------------------------


Regarding the question of whether something should be called a Distribution or 
a Sampler, the mathematical terminology is that a distribution is something you 
can sample so the the Distribution terminology would be most compatible that 
way.  The fact that only one method is currently defined is likely a temporary 
thing ... other methods could well be required for later efforts.


On Fri, Dec 26, 2008 at 10:53 AM, Jeff Eastman (JIRA) <[email protected]> wrote:

    ... Some of those were terms Ted introduced from my original port of his R
    example. I'm not hung up but perhaps we should include him in the
    discussion?


> dirichlet process implementation
> --------------------------------
>
>                 Key: MAHOUT-30
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-30
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Clustering
>            Reporter: Isabel Drost
>            Assignee: Jeff Eastman
>         Attachments: jeastman.vcf, MAHOUT-30.patch, MAHOUT-30b.patch, 
> MAHOUT-30c.patch
>
>
> Copied over from original issue:
> > Further extension can also be made by assuming an infinite mixture model. 
> > The implementation is only slightly more difficult and the result is a 
> > (nearly)
> > non-parametric clustering algorithm.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to