If I run mahout clustering on lucene vectors, how would I go about getting that 
cluster information back into lucene, in order to use the cluster identifiers 
in field collapsing?

I know I can re-index with the new cluster info, but is there any way to put 
cluster info into an existing index (which also may be non-optimized and quite 
large)?  One way maybe to have a custom field collapsing component that can 
read mahout cluster output.  Any thoughts?

Bob


Reply via email to