Nutch will pick up the first nutch-site.xml file in the classpath. This includes anything in the nutch.job file. My guess would be that your classes directory is not the first in the classpath, that there is another nutch-site.xml file earlier, possibly in the job file, and it is picking that up.

Dennis

James Harvey wrote:
Hi,

I'm trying to integrate nutch into a groovy/grails application which runs on
the jetty servlet container.  I can't seem to override the nutch-site.xml to
point to my crawl directory.  I am putting it in WEB-INF/classes/ directory,
but it doesn't seem to pick it up.  Any ideas why?  Is this the correct
place to put the nutch-site.xml file to override it for a java web
application.  Any help would be much appreciated.

Thanks,
-James

Reply via email to