How does the depth option work on the 0.8 recrawl script that is on http://wiki.apache.org/nutch/IntranetRecrawl . I just want to re-index all of the pages currently in the db and not index any new pages these pages might link to. Should I use a 0 for this? It seems like the fetcher never runs when I do 0, and if I do anything above zero it starts indexing at a further depth then what is currently in my crawl db, which is further then I desire.

-Chris Stephens

Reply via email to