As to point <2>, the only way I was able to deal with this was by using a TopDocs, which does have a max score. But in that case, I don't believe you can limit the number of hits examined.
I've just got to ask... Why do you (jafarim) want to fiddle with the threshold? How is this going to benefit the user over and above just getting the first N < 100 docs from a Hits object? They're sorted already in relevancy order. Yonik's point that scores aren't comparable across queries is well taken and should give you pause. A clear statement of what you are trying to accomplish from the user's perspective will allow folks to give you much more useful responses..... Erick On 4/22/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 4/22/07, jafarim <[EMAIL PROTECTED]> wrote: > > Be aware that > > score thresholds don't work well in general since scores aren't really > > comparable from one query to another. > > > What is I normalize the scores in such a manner that they become between 0 > and 1? Two issues with that: 1) You never *gain* information by normalizing in this manner. If non-normalized scores aren't directly comparable, then neither will normalized scores be. 2) To normalize by dividing by the max score, you need to know the max score. As hits are being collected in the HitCollector, the max score is not yet known. -Yonik --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]