I understand that part. What I'm unclear on is if there is any ranking or ordering of the points in each cluster before they are limited. In other words, are the points in each cluster random ordered? Or ordered alphabetically by the document id or filename? Or ordered by some calculation as to how they contributed mathematically to the formation of the cluster?

Thanks,

Terry



On 3/18/14, 9:41 PM, Suneel Marthi wrote:
Its the max. no. of points to include from each cluster in the clusterdump. If 
not specified all points would be included.





On Tuesday, March 18, 2014 11:25 PM, Terry Blankers <[email protected]> wrote:
Hi all,

Can someone please answer a quick question about the --samplePoints
parameter in the clusterdump utility? I understand it specifies the
number of points returned per cluster. But are the points per cluster
ordered or ranked in any way before this truncation occurs?

Thanks,

Terry

Reply via email to