Same question here, while (re)running the script, I see URLs that are supposed to be filtered out. Not sure how to make it right...
On Friday, December 16, 2011, Christopher Gross <[email protected]> wrote: > http://wiki.apache.org/nutch/Crawl > This script no longer works. See:echo "----- Index (Step 5 of $steps) > -----"$NUTCH_HOME/bin/nutch index crawl/NEWindexes crawl/crawldb > crawl/linkdb \ crawl/segments/* > The "index" call doesn't exist....so what does this line get > replacedwith? Is there an updated runbot.sh script? Has anyone > created a newone that will work? I've done some changes on it, but I > just don'tknow what to do for this part. > Thanks! > > -- Chris >

