Jonathan Ariel skrev:
Smart idea, but it won't help me. I have almost 50 categories and eventually I would like to "filter" not just on category but maybe also on language, etc. Karl: what do you mean by measure the distance between the term vectors and cluster them in real time?
I mean exactly what I say, that if your subsets are small enough you could evalute the cosine coefficient and group documents accordingly.
2 million documents is however way to much data to do that in real time. I would probably create one index for each "filter" you want to use. karl --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]