Anyone answer this question?  I see in the Hits class that there's a
boolean totalIsExact attribute, but this becomes false only when
deduplication (per site) occurs during the search.  And I see that
underneath Nutch, Lucene will obtain the documents for only the top
hits.

But does Nutch/Lucene return exact match counts for every query?  How
does this scale to very large indexes?

Thanks!
DaveG


-----Original Message-----
From: Gaulin, Mark [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, October 05, 2005 2:04 PM
To: nutch-dev@lucene.apache.org
Subject: Q about "exact" match counts?

Hi 
I was wondering if nutch returns exact match counts or estimated match
counts, or if this behavior was configurable.  The reason I ask is that
on a very large index I'd rather get the top 10 results right and get
them quickly and not spend time getting the total number of documents
that match exactly right. (I assume that's why Google reports totals as
"1 to 10 of about 1,000,000")   Is this something that nutch/lucene can
do?
Thanks
        Mark

Reply via email to