Hunter Peress <[EMAIL PROTECTED]> writes:

> is it efficient and feasible to use lucene to do full text
> comparisions. eg :  take an entire text thats reasonably large ( eg
> more than 10 words) and find the result set within the lucene search
> index that  is statistically similar with all the text.

Yes, if the query is a BooleanQuery, then all the clauses are
essentially weighted with tf.idf weighting.  There are of course more
sophisticated term weightings, some of which can be handled be
extending the Similarity class.

Ian



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to