On Fri, Jan 20, 2012 at 4:01 PM, Vikas Pandya <[email protected]> wrote:
> From the example below, solr search results should be clustered in some
> following way
> list all the items which have matching RiskLevels e.g.
>
>
> Cluster 1:
> Title          RiskLevel1          RiskLevel2         RiskLevel3
> abc            High                     Medium             Low
> xyz            High                      Medium            High
> def            Low                        Medium           High
>
> Cluster 2:
> Title          RiskLevel1          RiskLevel2         RiskLevel3
> omn            Low                     Medium             Low
> yui            Low                      Medium            High
> bnm            Medium             Medium           High
>
> Though I have a feeling I don't need to use Mahout clustering for this, I am
> still trying to hook in mahout for this since we have more clustering
> requirements in the pipeline to cluster based on other features (attributes
> of objects).
>

You only have 27 unique risklevel combinations. You could just sort by
or more risklevels to get a sense of the data.

If you have more attributes then you could indeed look into clustering,

Cheers,

Frank

> Any thoughts?
>
> ________________________________
> From: Vikas Pandya <[email protected]>
> To: Frank Scholten <[email protected]>; "[email protected]"
> <[email protected]>
> Sent: Thursday, January 19, 2012 11:05 AM
>
> Subject: Re: How to present mahout cluster in combination with Solr results
>
> Hi Frank,
>
> Thanks for the link. That was useful. It's still bit unclear on how he built
> his index. are we saying, we index  clusterId,clusterSize and clusterLable
> in the same index (where other data is indexed)? So one index will have two
> sets of Solr documents in it?  one containing cluster info?
>
> My requirement again; I have bunch of db columns which are being indexed.
> e.g.
> Title,             RiskLevel1, RiskLevel2,RiskLevel3 etc
> Title1        High             Medium      Low
>
> Current requirement is to cluster documents based on their riskLevels and
> NOT the title.
>
> Thanks,
>
>
> ________________________________
> From: Frank Scholten <[email protected]>
> To: [email protected]; Vikas Pandya <[email protected]>
> Sent: Thursday, January 19, 2012 4:24 AM
> Subject: Re: How to present mahout cluster in combination with Solr results
>
> Hi Vikas,
>
> I suggest indexing the cluster label, cluster size and
> cluster-document mappings so you can use that information to build a
> tag cloud of your data. Checkout this presentation
> http://java.dzone.com/videos/configuring-mahout-clustering
>
> Cheers,
>
> Frank
>
> On Thu, Jan 19, 2012 at 4:18 AM, Vikas Pandya <[email protected]> wrote:
>> Hello,
>>
>> I have successfully created vectors from reading my existing Solr Index.
>> Then created sequenceFile and mahout clusters from it. As I understand that
>> currently solr and mahout clustering aren't integrated, what's the best way
>> to represent mahout clusters to the user? Mine is a search application which
>> renders results by querying solr index. Now I need to incorporate Mahout
>> created clusters in the result. While Solr-Mahout integration isn't there
>> yet, what's the best alternative way to represent this info?
>>
>> Thanks,
>

Reply via email to