Re: nutch questions

elangovan anbalahan Fri, 12 Dec 2008 18:30:43 -0800

Hi there..

I am assuming that you have succesfully configured nutch and are able to
crawl websites.

Before i suggest you any solution , let me know the following;
1) Have you deployed nutch-XX.war on tomcat ? ( XX-means nutch version no.)
2)After deployment , you have to configure nutch-site.xml inside
WEB-INF/classes folder, to tell tomcat , there to look for crawled data.

If you have done this let me know.

On Fri, Dec 12, 2008 at 6:42 PM, Peter W. <[email protected]>wrote:

> Hello,
>
> I'm new to nutch and have successfully configured the fetching application
> but had some questions about its tomcat search component:
>
> a. should indexes be stored under the webapps dir?
> b. can these segments be read with a Luke type application?
> c. are the pages being stored as html? if so how do you filter out tags
> with an analyzer?
> d. is it possible to only check for http status code 200's
> e. how do you customize the search results templates?
>
> Thanks,
>
> Peter
>

Re: nutch questions

Reply via email to