hey Jack,

> In my project, I really re-crawl the website everytime, and add one
> url dedup listener to the crawl job. I mean when nutch finishes the
> crawl web site, url dedup follows.

ok, that's fine.

I have another doubt: nutch crawls at different depths every time you
order to it to crawl.
how you notice when it restarts from the beginning? maybe I have to
supply as argument old segments?

thanks.

ciao,
Marco

Reply via email to