Hi, I have a list of URLS I'm crawling, and I want to crawl the frequently. I crawl the list initially using ./nutch crawl urls -dir testcrawl -depth 1 and it works fine.
My question is, what is the best way to update directory testcrawl? >From what it seems, I can't directly write to testcrawl during the crawl. I have to crawl in a separate directory and then merge them? Is this correct? Thanks for the help, George ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
