Re: Crawling after a period of time

k-team Fri, 27 May 2005 01:52:29 -0700

hey Jack,

> In my project, I really re-crawl the website everytime, and add one
> url dedup listener to the crawl job. I mean when nutch finishes the
> crawl web site, url dedup follows.


ok, that's fine.

I have another doubt: nutch crawls at different depths every time you
order to it to crawl.
how you notice when it restarts from the beginning? maybe I have to
supply as argument old segments?

thanks.

ciao,
Marco

Re: Crawling after a period of time

Reply via email to