IndexOutOfBoundsException in RandomSeedGenerator.buildRandom()

Shannon Quinn Fri, 18 Jun 2010 13:00:23 -0700

Hi all,

Thanks once more for everyone's help so far, it's been extremelyfruitful. I'm about 98% of the way finished with my first sprint, butunfortunately there is a single error on my second-to-last line of code.

Right after performing an eigen-decomposition using theDistributedLanczosSolver, I feed the outputs directly into the KMeansutility, RandomSeedGenerator, in order to create random clustercentroids for a given K. Unfortunately, during that buildRandom() methodcall, I hit an index out of bounds exception, and it seems to be anoff-by-1 problem (for k=3, the arrays generated are only of length 2).

More detail to be found here:http://spectrallyclustered.wordpress.com/2010/06/18/sprint-1-so-very-close/

I think part of the problem is due to a lack of understanding of theLanczosSolver process. I do know that the eigenvectors are returned asrows in a matrix, in which case the data points I need to feed to KMeansare the columns. How does the desiredRank parameter fit in when it'sreturning a row matrix? The rule of thumb I'm using is that # ofclusters = # of eigenvectors, is there any way to enforce this heuristicexplicitly?

Any insights here would be greatly appreciated; I've posted a patch withmy latest code on JIRA. Thanks so much!


Regards,
Shannon

IndexOutOfBoundsException in RandomSeedGenerator.buildRandom()

Reply via email to