What I really would like is a way to pass in the
location of the config files (e.g. nutch-default.xml,
regex-urlfilter.txt, etc.) as an argument to the nutch
script, so that I can have multiple config files (each
for a different site I wish to crawl).
Create a file mynutch.sh:
NUTCH_CONF_DIR=$1; export NUTCH_CONF_DIR
shift;
echo using NUTCH_CONF_DIR = $NUTCH_CONF_DIR
echo bin/nutch $@
bin/nutch $@
-------------------------------------------------------
This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening
July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual
core and dual graphics technology at this free one hour event hosted by HP,
AMD, and NVIDIA. To register visit http://www.hp.com/go/dualwebinar
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general