I understand that part. What I'm unclear on is if there is any ranking
or ordering of the points in each cluster before they are limited. In
other words, are the points in each cluster random ordered? Or ordered
alphabetically by the document id or filename? Or ordered by some
calculation as to how they contributed mathematically to the formation
of the cluster?
Thanks,
Terry
On 3/18/14, 9:41 PM, Suneel Marthi wrote:
Its the max. no. of points to include from each cluster in the clusterdump. If
not specified all points would be included.
On Tuesday, March 18, 2014 11:25 PM, Terry Blankers <[email protected]> wrote:
Hi all,
Can someone please answer a quick question about the --samplePoints
parameter in the clusterdump utility? I understand it specifies the
number of points returned per cluster. But are the points per cluster
ordered or ranked in any way before this truncation occurs?
Thanks,
Terry