Pe 19.01.2012 05:18, Vikas Pandya a scris:
Hello,
I have successfully created vectors from reading my existing Solr Index. Then
created sequenceFile and mahout clusters from it. As I understand that
currently solr and mahout clustering aren't integrated, what's the best way to
represent mahout clusters to the user? Mine is a search application which
renders results by querying solr index. Now I need to incorporate Mahout
created clusters in the result. While Solr-Mahout integration isn't there yet,
what's the best alternative way to represent this info?
Thanks,
The only thing I can think of is to render the results yourself by
reading the clusters. It depends very much on what information are you
trying to extract and present to the user. I can think of about two
things that you can find out:
- similar documents to the ones provided by a Solr search (by getting
the cluster to which they belong and getting the documents).
- documents that have top terms that match the search query
You can find out good examples on how to do this by looking at the
ClusterDumper utility that reads and dumps clusters.
Hope this helps,
--
Ioan Eugen Stan
http://ieugen.blogspot.com