hey Jack, > In my project, I really re-crawl the website everytime, and add one > url dedup listener to the crawl job. I mean when nutch finishes the > crawl web site, url dedup follows.
ok, that's fine. I have another doubt: nutch crawls at different depths every time you order to it to crawl. how you notice when it restarts from the beginning? maybe I have to supply as argument old segments? thanks. ciao, Marco
