Can you be more specific about which code you are asking about? The ball k-means implementation provides a capability somewhat like this, but perhaps in a more clearly defined way.
On Tue, Dec 3, 2013 at 9:34 AM, Prabhakar Srinivasan < [email protected]> wrote: > Hello! > Can someone point me to some explanatory documentation for Outlier > Detection & Removal in Clustering in Mahout. I am unable to understand the > internal mechanism of outlier detection just by reading the Javadoc: > clusterClassificationThreshold Is a clustering strictness / outlier removal > parameter. Its value should be between 0 and 1. Vectors having pdf below > this value will not be clustered. > > What does the pdf represent? > > Thanks > Prabhakar >
