>From a GSOC angle, it needn't be done, its upto your mentor to decide. I am interested more in getting this completed and pushed out so that people can really use it. If you can spare time after GSOC and still hang around the community and help in getting this polished, it will be great.
To create your pairwise similarity(0-1 1 means dissimilar) matrix(it can be the other way around as well), see the DistanceMeasure implementations. Creating the pairwise matrix is non trivial from a scalability stand point. A complete spectral clustering package should take an input set of documents, create the matrix and run clustering and output the clusters. To get an idea of your work till now, what are the blocks missing from this ideal package scenario? Robin
