Thanks for the response. I downloaded Luke and found that the index folder is all setup and working correctly. As shown in the logs below, Tomcat can see the crawl folder and the necessary subfolders. However, I'm not sure where to configure the bean to make sure that it runs the correct query parser. In Luke it defaults to org.apache.lucene.analysis.KeywordAnalyzer. In the JSPs, I have no idea what the query parser is.
Please help. Thanks again. -----Original Message----- From: Renaud Richardet [mailto:[EMAIL PROTECTED] Sent: Wednesday, August 02, 2006 8:31 AM To: [email protected] Subject: Re: Nutch Install problems Hello Fred, You can inspect a Lucene index with Luke (http://www.getopt.org/luke/), or from the commandline (see https://issues.apache.org/jira/browse/NUTCH-330 to install the patch). We also experienced some issues with Hadoop on WinXP/Nutch 0.8, you might want to look at https://issues.apache.org/jira/browse/NUTCH-266 HTH, Renaud Fred Tyre wrote: > I have installed nutch 0.8 on a Windows XP machine. > > I hacked my way through the crawling/indexing and feel like I am right on > the verge of getting this working. > > However, I cannot get any search results via the web page. > > Is there any way I can verify the indexes that were generated on the command > line? > > When I run "bin/nutch readlinkdb ..." nutch generates a text file with 4,454 > lines of links and anchor text. > > So up to that point it should be working (hopefully). > > Please help. > > Thanks. > > Here is the Tomcat Log... > > 2006-08-01 17:47:31,656 INFO NutchBean - creating new bean > 2006-08-01 17:47:31,671 INFO NutchBean - opening indexes in > file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps > /nutch-0.8/WEB-INF/classes/crawl/indexes > 2006-08-01 17:47:31,765 INFO Configuration - found resource > common-terms.utf8 at > file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps > /nutch-0.8/WEB-INF/classes/common-terms.utf8 > 2006-08-01 17:47:31,796 INFO NutchBean - opening segments in > file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps > /nutch-0.8/WEB-INF/classes/crawl/segments > 2006-08-01 17:47:31,828 INFO SummarizerFactory - Using the first summarizer > extension found: Basic Summarizer > 2006-08-01 17:47:31,828 INFO NutchBean - opening linkdb in > file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps > /nutch-0.8/WEB-INF/classes/crawl/linkdb > 2006-08-01 17:47:31,843 INFO NutchBean - query request from 127.0.0.1 > 2006-08-01 17:47:31,859 INFO NutchBean - query: forums > 2006-08-01 17:47:31,859 INFO NutchBean - lang: en > 2006-08-01 17:47:31,890 INFO NutchBean - searching for 20 raw hits > 2006-08-01 17:47:31,984 INFO NutchBean - total hits: 0 > 2006-08-01 17:50:30,343 INFO NutchBean - query request from 127.0.0.1 > 2006-08-01 17:50:30,343 INFO NutchBean - query: the > 2006-08-01 17:50:30,343 INFO NutchBean - lang: en > 2006-08-01 17:50:30,343 INFO NutchBean - searching for 20 raw hits > 2006-08-01 17:50:30,343 INFO NutchBean - total hits: 0 > 2006-08-01 17:50:35,390 INFO NutchBean - query request from 127.0.0.1 > 2006-08-01 17:50:35,390 INFO NutchBean - query: publishing > 2006-08-01 17:50:35,390 INFO NutchBean - lang: en > 2006-08-01 17:50:35,390 INFO NutchBean - searching for 20 raw hits > 2006-08-01 17:50:35,390 INFO NutchBean - total hits: 0 > > Sincerely, > Fred > > >> <><><><><><><><><><><><><><><><><>< >> > Fred Tyre > Information Services > Heartland Communications, Inc. > 515-574-2147 > [EMAIL PROTECTED] > >> <><><><><><><><><><><><><><><><><>< >> > > > > > -- Renaud Richardet COO America Wyona Inc. - Open Source Content Management - Apache Lenya office +1 857 776-3195 mobile +1 617 230 9112 renaud.richardet <at> wyona.com http://www.wyona.com
