On Aug 5, 2009, at 6:57 PM, Paul Tomblin wrote:

Hi Paul

I want to do some specific crawling where I crawl one site with one
set of urls to accept/reject, then reset to crawl another site with
another set of urls to accept/reject, etc.

i'm not sure if i understand what you mean. but if you want to crawl specific urls and maybe exclude some urls you can use the Black/White Url Filter.
http://issues.apache.org/jira/browse/NUTCH-249

Apply the patch "bw.patch" and read the comment how you can use it.


marko

Reply via email to