>From the example below, solr search results should be clustered in some >following way list all the items which have matching RiskLevels e.g.
Cluster 1: Title RiskLevel1 RiskLevel2 RiskLevel3 abc High Medium Low xyz High Medium High def Low Medium High Cluster 2: Title RiskLevel1 RiskLevel2 RiskLevel3 omn Low Medium Low yui Low Medium High bnm Medium Medium High Though I have a feeling I don't need to use Mahout clustering for this, I am still trying to hook in mahout for this since we have more clustering requirements in the pipeline to cluster based on other features (attributes of objects). Any thoughts? ________________________________ From: Vikas Pandya <[email protected]> To: Frank Scholten <[email protected]>; "[email protected]" <[email protected]> Sent: Thursday, January 19, 2012 11:05 AM Subject: Re: How to present mahout cluster in combination with Solr results Hi Frank, Thanks for the link. That was useful. It's still bit unclear on how he built his index. are we saying, we index clusterId,clusterSize and clusterLable in the same index (where other data is indexed)? So one index will have two sets of Solr documents in it? one containing cluster info? My requirement again; I have bunch of db columns which are being indexed. e.g. Title, RiskLevel1, RiskLevel2,RiskLevel3 etc Title1 High Medium Low Current requirement is to cluster documents based on their riskLevels and NOT the title. Thanks, ________________________________ From: Frank Scholten <[email protected]> To: [email protected]; Vikas Pandya <[email protected]> Sent: Thursday, January 19, 2012 4:24 AM Subject: Re: How to present mahout cluster in combination with Solr results Hi Vikas, I suggest indexing the cluster label, cluster size and cluster-document mappings so you can use that information to build a tag cloud of your data. Checkout this presentation http://java.dzone.com/videos/configuring-mahout-clustering Cheers, Frank On Thu, Jan 19, 2012 at 4:18 AM, Vikas Pandya <[email protected]> wrote: > Hello, > > I have successfully created vectors from reading my existing Solr Index. Then > created sequenceFile and mahout clusters from it. As I understand that > currently solr and mahout clustering aren't integrated, what's the best way > to represent mahout clusters to the user? Mine is a search application which > renders results by querying solr index. Now I need to incorporate Mahout > created clusters in the result. While Solr-Mahout integration isn't there > yet, what's the best alternative way to represent this info? > > Thanks,
