I have installed nutch 0.8 on a Windows XP machine.

I hacked my way through the crawling/indexing and feel like I am right on
the verge of getting this working.

However, I cannot get any search results via the web page.

Is there any way I can verify the indexes that were generated on the command
line?

When I run "bin/nutch readlinkdb ..." nutch generates a text file with 4,454
lines of links and anchor text.

So up to that point it should be working (hopefully).

Please help.

Thanks.

Here is the Tomcat Log...

2006-08-01 17:47:31,656 INFO  NutchBean - creating new bean
2006-08-01 17:47:31,671 INFO  NutchBean - opening indexes in
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/crawl/indexes
2006-08-01 17:47:31,765 INFO  Configuration - found resource
common-terms.utf8 at
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/common-terms.utf8
2006-08-01 17:47:31,796 INFO  NutchBean - opening segments in
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/crawl/segments
2006-08-01 17:47:31,828 INFO  SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2006-08-01 17:47:31,828 INFO  NutchBean - opening linkdb in
file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
/nutch-0.8/WEB-INF/classes/crawl/linkdb
2006-08-01 17:47:31,843 INFO  NutchBean - query request from 127.0.0.1
2006-08-01 17:47:31,859 INFO  NutchBean - query: forums
2006-08-01 17:47:31,859 INFO  NutchBean - lang: en
2006-08-01 17:47:31,890 INFO  NutchBean - searching for 20 raw hits
2006-08-01 17:47:31,984 INFO  NutchBean - total hits: 0
2006-08-01 17:50:30,343 INFO  NutchBean - query request from 127.0.0.1
2006-08-01 17:50:30,343 INFO  NutchBean - query: the
2006-08-01 17:50:30,343 INFO  NutchBean - lang: en
2006-08-01 17:50:30,343 INFO  NutchBean - searching for 20 raw hits
2006-08-01 17:50:30,343 INFO  NutchBean - total hits: 0
2006-08-01 17:50:35,390 INFO  NutchBean - query request from 127.0.0.1
2006-08-01 17:50:35,390 INFO  NutchBean - query: publishing
2006-08-01 17:50:35,390 INFO  NutchBean - lang: en
2006-08-01 17:50:35,390 INFO  NutchBean - searching for 20 raw hits
2006-08-01 17:50:35,390 INFO  NutchBean - total hits: 0

Sincerely,
Fred

><><><><><><><><><><><><><><><><><><
   Fred Tyre
   Information Services
   Heartland Communications, Inc.
   515-574-2147
   [EMAIL PROTECTED]
><><><><><><><><><><><><><><><><><><




-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to