When you run a search from Tomcat, what is written to your logs? Do you see something like whats below, but pointing to a different path (your correct path)? NutchBean - opening segments in /usr/local/nutch/build/nutch-0.9-dev/crawl/segments NutchBean - opening linkdb in /usr/local/nutch/build/nutch-0.9-dev/crawl/linkdb
Luke (http://www.getopt.org/luke/) will do exactly what you want in terms of finding out about documents indexed in your database and much more. For your search query times, try using what was suggested here: http://www.mail-archive.com/[email protected]/msg05392.html. ----- Original Message ---- From: Justin Hartman <[EMAIL PROTECTED]> To: [email protected] Sent: Friday, December 29, 2006 7:52:11 AM Subject: Searching via http & statistical data Hi guys I have my nutch system working pretty reasonably I think and I am quite happy with the way it is fetching, crawling and indexing. I do have a problem however in that I can not figure out how to make the http searches pull data from the index. Running the searcher command[1] brings up a list of search results however when I run the same search from the http side[2] it generates zero results. I've gone through the nutch tutorials[3+4] as well as tried to implement the faq question[5] that addresses this very issue but I still get no results. This current server is running CentOS with Plesk 8.1/Tomcat 5 and Java 1.4.2. Because Plesk does very odd things I've had to change some of the config values in my tomcat5.conf file but this change was just re-writing the access path to nutch. I'm honestly fresh out of ideas and problem-solving and now need to resort to some help from the experts! I'd also like to ask if there is anyway to view any or all of the following information: 1. Documents indexed in the database 2. Search query times Any help on the above two questions is appreciated. [1] bin/nutch org.apache.nutch.searcher.NutchBean apache [2] http://localhost:9080/search.jsp?lang=en&query=apache [3] http://wiki.apache.org/nutch/NutchTutorial [4] http://lucene.apache.org/nutch/tutorial8.html [5] http://wiki.apache.org/nutch/FAQ#head-0c5dd359a76f9ac5ed54f9d81d79130e4c9c3302 -- Regards Justin Hartman PGP Key ID: 102CC123
