Thanks, that's helpful. But if I understand what it's saying, although it shows how to point to a different config directory, you're limited to one alternate configuration since you have to set the location in the NUTCH_CONF_DIR environment variable.
What I really would like is a way to pass in the location of the config files (e.g. nutch-default.xml, regex-urlfilter.txt, etc.) as an argument to the nutch script, so that I can have multiple config files (each for a different site I wish to crawl). I found a somewhat clumsy way to accomplish this, by modfiying the nutch script so that it prepends the current directory to the CLASSPATH and running a copy of the script from the directory that has my config files. This way the script first looks in the current directory and it picks up my site-specific config files. By the way, sorry for the extra post - haven't used a mailing list in awhile. --- Juho Mäkinen <[EMAIL PROTECTED]> wrote: > Take a look into Nutch Wiki FAQ here: > http://wiki.apache.org/nutch/FAQ > And find the Q/A for "How can I force fetcher to use > custom nutch-config?" > > - Juho Mäkinen, http://www.juhonkoti.net > > On 7/8/05, Raymond Creel <[EMAIL PROTECTED]> > wrote: > > I'm just getting started with Nutch. Does someone > > know how I may be able to get the nutch > command-line > > script to load different > > nutch-default.xml/nutch-site.xml files than what > is in > > the nutch/conf directory? I want to be able to > run > > nutch at different sites with different startup > > configurations. > > > > Thanks, > > Raymond Creel > > > > > > > > __________________________________ > > Do you Yahoo!? > > Read only the mail you want - Yahoo! Mail > SpamGuard. > > http://promotions.yahoo.com/new_mail > > > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual core and dual graphics technology at this free one hour event hosted by HP, AMD, and NVIDIA. To register visit http://www.hp.com/go/dualwebinar _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
