Hello Fred,
You can inspect a Lucene index with Luke (http://www.getopt.org/luke/),
or from the commandline (see
https://issues.apache.org/jira/browse/NUTCH-330 to install the patch).
We also experienced some issues with Hadoop on WinXP/Nutch 0.8, you
might want to look at https://issues.apache.org/jira/browse/NUTCH-266
HTH,
Renaud
Fred Tyre wrote:
I have installed nutch 0.8 on a Windows XP machine.
I hacked my way through the crawling/indexing and feel like I am right on
the verge of getting this working.
However, I cannot get any search results via the web page.
Is there any way I can verify the indexes that were generated on the command
line?
When I run "bin/nutch readlinkdb ..." nutch generates a text file with 4,454
lines of links and anchor text.
So up to that point it should be working (hopefully).
Please help.
Thanks.
Here is the Tomcat Log...
2006-08-01 17:47:31,656 INFO NutchBean - creating new bean
2006-08-01 17:47:31,671 INFO NutchBean - opening indexes in
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/crawl/indexes
2006-08-01 17:47:31,765 INFO Configuration - found resource
common-terms.utf8 at
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/common-terms.utf8
2006-08-01 17:47:31,796 INFO NutchBean - opening segments in
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/crawl/segments
2006-08-01 17:47:31,828 INFO SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2006-08-01 17:47:31,828 INFO NutchBean - opening linkdb in
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/crawl/linkdb
2006-08-01 17:47:31,843 INFO NutchBean - query request from 127.0.0.1
2006-08-01 17:47:31,859 INFO NutchBean - query: forums
2006-08-01 17:47:31,859 INFO NutchBean - lang: en
2006-08-01 17:47:31,890 INFO NutchBean - searching for 20 raw hits
2006-08-01 17:47:31,984 INFO NutchBean - total hits: 0
2006-08-01 17:50:30,343 INFO NutchBean - query request from 127.0.0.1
2006-08-01 17:50:30,343 INFO NutchBean - query: the
2006-08-01 17:50:30,343 INFO NutchBean - lang: en
2006-08-01 17:50:30,343 INFO NutchBean - searching for 20 raw hits
2006-08-01 17:50:30,343 INFO NutchBean - total hits: 0
2006-08-01 17:50:35,390 INFO NutchBean - query request from 127.0.0.1
2006-08-01 17:50:35,390 INFO NutchBean - query: publishing
2006-08-01 17:50:35,390 INFO NutchBean - lang: en
2006-08-01 17:50:35,390 INFO NutchBean - searching for 20 raw hits
2006-08-01 17:50:35,390 INFO NutchBean - total hits: 0
Sincerely,
Fred
<><><><><><><><><><><><><><><><><><
Fred Tyre
Information Services
Heartland Communications, Inc.
515-574-2147
[EMAIL PROTECTED]
<><><><><><><><><><><><><><><><><><
--
Renaud Richardet
COO America
Wyona Inc. - Open Source Content Management - Apache Lenya
office +1 857 776-3195 mobile +1 617 230 9112
renaud.richardet <at> wyona.com http://www.wyona.com