hey Jack,

> In my project, I really re-crawl the website everytime, and add one
> url dedup listener to the crawl job. I mean when nutch finishes the
> crawl web site, url dedup follows.

ok, that's fine.

I have another doubt: nutch crawls at different depths every time you
order to it to crawl.
how you notice when it restarts from the beginning? maybe I have to
supply as argument old segments?

thanks.

ciao,
Marco


-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to