Hunter Peress <[EMAIL PROTECTED]> writes: > is it efficient and feasible to use lucene to do full text > comparisions. eg : take an entire text thats reasonably large ( eg > more than 10 words) and find the result set within the lucene search > index that is statistically similar with all the text.
Yes, if the query is a BooleanQuery, then all the clauses are essentially weighted with tf.idf weighting. There are of course more sophisticated term weightings, some of which can be handled be extending the Similarity class. Ian --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]