Same question here, while (re)running the script, I see URLs that are
supposed to be filtered out. Not sure how to make it right...

On Friday, December 16, 2011, Christopher Gross <[email protected]> wrote:
> http://wiki.apache.org/nutch/Crawl
> This script no longer works.  See:echo "----- Index (Step 5 of $steps)
> -----"$NUTCH_HOME/bin/nutch index crawl/NEWindexes crawl/crawldb
> crawl/linkdb \   crawl/segments/*
> The "index" call doesn't exist....so what does this line get
> replacedwith?  Is there an updated runbot.sh script?  Has anyone
> created a newone that will work?  I've done some changes on it, but I
> just don'tknow what to do for this part.
> Thanks!
>
> -- Chris
>

Reply via email to