Thanks for the response.

I downloaded Luke and found that the index folder is all setup and working
correctly.
As shown in the logs below, Tomcat can see the crawl folder and the
necessary subfolders.
However, I'm not sure where to configure the bean to make sure that it runs
the correct query parser.
In Luke it defaults to org.apache.lucene.analysis.KeywordAnalyzer.
In the JSPs, I have no idea what the query parser is.

Please help.

Thanks again.

-----Original Message-----
From: Renaud Richardet [mailto:[EMAIL PROTECTED]
Sent: Wednesday, August 02, 2006 8:31 AM
To: [email protected]
Subject: Re: Nutch Install problems


Hello Fred,

You can inspect a Lucene index with Luke (http://www.getopt.org/luke/),
or from the commandline (see
https://issues.apache.org/jira/browse/NUTCH-330 to install the patch).

We also experienced some issues with Hadoop on WinXP/Nutch 0.8, you
might want to look at https://issues.apache.org/jira/browse/NUTCH-266

HTH,
Renaud


Fred Tyre wrote:
> I have installed nutch 0.8 on a Windows XP machine.
>
> I hacked my way through the crawling/indexing and feel like I am right on
> the verge of getting this working.
>
> However, I cannot get any search results via the web page.
>
> Is there any way I can verify the indexes that were generated on the
command
> line?
>
> When I run "bin/nutch readlinkdb ..." nutch generates a text file with
4,454
> lines of links and anchor text.
>
> So up to that point it should be working (hopefully).
>
> Please help.
>
> Thanks.
>
> Here is the Tomcat Log...
>
> 2006-08-01 17:47:31,656 INFO  NutchBean - creating new bean
> 2006-08-01 17:47:31,671 INFO  NutchBean - opening indexes in
>
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/crawl/indexes
> 2006-08-01 17:47:31,765 INFO  Configuration - found resource
> common-terms.utf8 at
>
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/common-terms.utf8
> 2006-08-01 17:47:31,796 INFO  NutchBean - opening segments in
>
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/crawl/segments
> 2006-08-01 17:47:31,828 INFO  SummarizerFactory - Using the first
summarizer
> extension found: Basic Summarizer
> 2006-08-01 17:47:31,828 INFO  NutchBean - opening linkdb in
>
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/crawl/linkdb
> 2006-08-01 17:47:31,843 INFO  NutchBean - query request from 127.0.0.1
> 2006-08-01 17:47:31,859 INFO  NutchBean - query: forums
> 2006-08-01 17:47:31,859 INFO  NutchBean - lang: en
> 2006-08-01 17:47:31,890 INFO  NutchBean - searching for 20 raw hits
> 2006-08-01 17:47:31,984 INFO  NutchBean - total hits: 0
> 2006-08-01 17:50:30,343 INFO  NutchBean - query request from 127.0.0.1
> 2006-08-01 17:50:30,343 INFO  NutchBean - query: the
> 2006-08-01 17:50:30,343 INFO  NutchBean - lang: en
> 2006-08-01 17:50:30,343 INFO  NutchBean - searching for 20 raw hits
> 2006-08-01 17:50:30,343 INFO  NutchBean - total hits: 0
> 2006-08-01 17:50:35,390 INFO  NutchBean - query request from 127.0.0.1
> 2006-08-01 17:50:35,390 INFO  NutchBean - query: publishing
> 2006-08-01 17:50:35,390 INFO  NutchBean - lang: en
> 2006-08-01 17:50:35,390 INFO  NutchBean - searching for 20 raw hits
> 2006-08-01 17:50:35,390 INFO  NutchBean - total hits: 0
>
> Sincerely,
> Fred
>
>
>> <><><><><><><><><><><><><><><><><><
>>
>    Fred Tyre
>    Information Services
>    Heartland Communications, Inc.
>    515-574-2147
>    [EMAIL PROTECTED]
>
>> <><><><><><><><><><><><><><><><><><
>>
>
>
>
>
>

--
Renaud Richardet
COO America
Wyona Inc.  -   Open Source Content Management   -   Apache Lenya
office +1 857 776-3195                     mobile +1 617 230 9112
renaud.richardet <at> wyona.com              http://www.wyona.com


Reply via email to