Tim Patton wrote:
Thanks, that's exactly what I was thinking.  Do you have any recommendations
on maximum index size (obviously we'd be testing ourselves, but its good to
get an idea)?

Searches tend to get too slow somewhere betwen 10M and 100M pages.

Using a sorted index (IndexSorter & searcher.max.hits) can improve the situation dramatically but may not be appropriate for all applications. For a discussion of this feature, see:

http://www.mail-archive.com/[email protected]/msg06423.html

and

http://www.mail-archive.com/nutch-dev%40lucene.apache.org/msg01950.html

Doug


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to