If I run mahout clustering on lucene vectors, how would I go about getting that cluster information back into lucene, in order to use the cluster identifiers in field collapsing?
I know I can re-index with the new cluster info, but is there any way to put cluster info into an existing index (which also may be non-optimized and quite large)? One way maybe to have a custom field collapsing component that can read mahout cluster output. Any thoughts? Bob
