Thanks, that's helpful.  But if I understand what it's
saying, although it shows how to point to a different
config directory, you're limited to one alternate
configuration since you have to set the location in
the NUTCH_CONF_DIR environment variable.  

What I really would like is a way to pass in the
location of the config files (e.g. nutch-default.xml,
regex-urlfilter.txt, etc.) as an argument to the nutch
script, so that I can have multiple config files (each
for a different site I wish to crawl).

I found a somewhat clumsy way to accomplish this, by
modfiying the nutch script so that it prepends the
current directory to the CLASSPATH and running a copy
of the script from the directory that has my config
files.  This way the script first looks in the current
directory and it picks up my site-specific config
files.

By the way, sorry for the extra post - haven't used a
mailing list in awhile.

--- Juho Mäkinen <[EMAIL PROTECTED]> wrote:

> Take a look into Nutch Wiki FAQ here:
> http://wiki.apache.org/nutch/FAQ
> And find the Q/A for "How can I force fetcher to use
> custom nutch-config?"
> 
>  - Juho Mäkinen, http://www.juhonkoti.net
> 
> On 7/8/05, Raymond Creel <[EMAIL PROTECTED]>
> wrote:
> > I'm just getting started with Nutch.  Does someone
> > know how I may be able to get the nutch
> command-line
> > script to load different
> > nutch-default.xml/nutch-site.xml files than what
> is in
> > the nutch/conf directory?  I want to be able to
> run
> > nutch at different sites with different startup
> > configurations.
> > 
> > Thanks,
> > Raymond Creel
> > 
> > 
> > 
> > __________________________________
> > Do you Yahoo!?
> > Read only the mail you want - Yahoo! Mail
> SpamGuard.
> > http://promotions.yahoo.com/new_mail
> >
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


-------------------------------------------------------
This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening
July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual
core and dual graphics technology at this free one hour event hosted by HP, 
AMD, and NVIDIA.  To register visit http://www.hp.com/go/dualwebinar
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to