[
https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12659260#action_12659260
]
Ted Dunning commented on MAHOUT-30:
-----------------------------------
Regarding the question of whether something should be called a Distribution or
a Sampler, the mathematical terminology is that a distribution is something you
can sample so the the Distribution terminology would be most compatible that
way. The fact that only one method is currently defined is likely a temporary
thing ... other methods could well be required for later efforts.
On Fri, Dec 26, 2008 at 10:53 AM, Jeff Eastman (JIRA) <[email protected]> wrote:
... Some of those were terms Ted introduced from my original port of his R
example. I'm not hung up but perhaps we should include him in the
discussion?
> dirichlet process implementation
> --------------------------------
>
> Key: MAHOUT-30
> URL: https://issues.apache.org/jira/browse/MAHOUT-30
> Project: Mahout
> Issue Type: New Feature
> Components: Clustering
> Reporter: Isabel Drost
> Assignee: Jeff Eastman
> Attachments: jeastman.vcf, MAHOUT-30.patch, MAHOUT-30b.patch,
> MAHOUT-30c.patch
>
>
> Copied over from original issue:
> > Further extension can also be made by assuming an infinite mixture model.
> > The implementation is only slightly more difficult and the result is a
> > (nearly)
> > non-parametric clustering algorithm.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.