We should probably have an option to down-sample large clusters to make the PDF computation faster.
On Thu, Feb 24, 2011 at 3:09 PM, Jeff Eastman <[email protected]> wrote: > Again, if most of your points are being assigned to a single cluster that > reducer will be bogged down observing them all.
