Hi,

I have a list of URLS I'm crawling, and I want to crawl the
frequently.  I crawl the list initially using  ./nutch crawl urls -dir
testcrawl -depth 1 and it works fine.

My question is, what is the best way to update directory testcrawl? 
>From what it seems, I can't directly write to testcrawl during the
crawl.  I have to crawl in a separate directory and then merge them?
Is this correct?

Thanks for the help,
George


-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO September
19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to