Sorry about the attachment. see it here. http://yfrog.com/4epicture1pfp
On Fri, Feb 19, 2010 at 1:25 AM, Robin Anil <robin.a...@gmail.com> wrote: > I was trying out SeqAccessSparseVector on Canopy Clustering using Manhattan > distance. I found performance to be really bad. So I profiled it with > Yourkit(Thanks a lot for providing us free license) > > Since i was trying out manhattan distance, there were a lot of A-B which > created a lot of clone operation 5% of the total time > there were also so many A+B for adding a point to the canopy to average. > this was also creating a lot of clone operations. 90% of the total time > > So we definitely needs to improve that.. > > For a small hack. I made the cluster centers RandomAccess Vector. Things > are fast again. I dont know whether to commit or not. But something to look > into in 0.4? > > Robin > > >