Thanks for the reply! I guess I don't mind using topN as long as I can be assured that I will get ALL of the urls crawled eventually. Do you know if that is a true statement?
-- View this message in context: http://lucene.472066.n3.nabble.com/Large-Shared-Drive-Crawl-tp3781917p3783663.html Sent from the Nutch - User mailing list archive at Nabble.com.