Hi there.. I am assuming that you have succesfully configured nutch and are able to crawl websites.
Before i suggest you any solution , let me know the following; 1) Have you deployed nutch-XX.war on tomcat ? ( XX-means nutch version no.) 2)After deployment , you have to configure nutch-site.xml inside WEB-INF/classes folder, to tell tomcat , there to look for crawled data. If you have done this let me know. On Fri, Dec 12, 2008 at 6:42 PM, Peter W. <[email protected]>wrote: > Hello, > > I'm new to nutch and have successfully configured the fetching application > but had some questions about its tomcat search component: > > a. should indexes be stored under the webapps dir? > b. can these segments be read with a Luke type application? > c. are the pages being stored as html? if so how do you filter out tags > with an analyzer? > d. is it possible to only check for http status code 200's > e. how do you customize the search results templates? > > Thanks, > > Peter >
