Anyone answer this question? I see in the Hits class that there's a boolean totalIsExact attribute, but this becomes false only when deduplication (per site) occurs during the search. And I see that underneath Nutch, Lucene will obtain the documents for only the top hits.
But does Nutch/Lucene return exact match counts for every query? How does this scale to very large indexes? Thanks! DaveG -----Original Message----- From: Gaulin, Mark [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 05, 2005 2:04 PM To: nutch-dev@lucene.apache.org Subject: Q about "exact" match counts? Hi I was wondering if nutch returns exact match counts or estimated match counts, or if this behavior was configurable. The reason I ask is that on a very large index I'd rather get the top 10 results right and get them quickly and not spend time getting the total number of documents that match exactly right. (I assume that's why Google reports totals as "1 to 10 of about 1,000,000") Is this something that nutch/lucene can do? Thanks Mark