I think I was the one who didn't follow. I thought you meant the optimizations to use sparse techniques for the distance computation.
Another candidate for the problem is that the centroids may be filling in and becoming dense. On Mon, Jul 27, 2009 at 9:41 AM, Grant Ingersoll <[email protected]>wrote: > That explains why Jeff didn't see the slow down with dense vectors. >> > > Not following. The distance calc stuff is irrespective of the type of > Vector. I was referring to the centroid length square (I think you called > it the triangle inequality) stuff that Shashikant added on MAHOUT-121. We > use it for testing convergence, but not for other distance calculations. I > haven't looked to see if it is applicable yet, but it seems like it should > be. -- Ted Dunning, CTO DeepDyve
