Nutch will pick up the first nutch-site.xml file in the classpath. This
includes anything in the nutch.job file. My guess would be that your
classes directory is not the first in the classpath, that there is
another nutch-site.xml file earlier, possibly in the job file, and it is
picking that up.
Dennis
James Harvey wrote:
Hi,
I'm trying to integrate nutch into a groovy/grails application which runs on
the jetty servlet container. I can't seem to override the nutch-site.xml to
point to my crawl directory. I am putting it in WEB-INF/classes/ directory,
but it doesn't seem to pick it up. Any ideas why? Is this the correct
place to put the nutch-site.xml file to override it for a java web
application. Any help would be much appreciated.
Thanks,
-James