Now that we have some code in place for clustering, I think it would be cool to put together some examples/demos of real world problems. Things like clustering text (perhaps we can use the wikipedia download or the reuters download that Lucene contrib/benchmark uses) or clustering other pieces of data.

We could setup a demo area of code and use Lucene's analysis code to create document vectors.

Ideas and/or thoughts or volunteers?

Cheers,
Grant

Reply via email to