Okay, I think I have done a good crawl and when I do a stats command I get this...
[EMAIL PROTECTED] nutch-nightly]# bin/nutch readdb crawl/crawldb -stats 060130 175317 CrawlDb statistics start: crawl/crawldb 060130 175317 parsing file:/nutch_binaries/nutch-nightly/conf/nutch-default.xml 060130 175318 parsing file:/nutch_binaries/nutch-nightly/conf/mapred-default.xml 060130 175318 parsing file:/nutch_binaries/nutch-nightly/conf/nutch-site.xml 060130 175318 parsing file:/nutch_binaries/nutch-nightly/conf/nutch-default.xml 060130 175318 parsing file:/nutch_binaries/nutch-nightly/conf/nutch-site.xml 060130 175318 Running job: job_y60768 060130 175318 parsing file:/nutch_binaries/nutch-nightly/conf/nutch-default.xml 060130 175319 parsing file:/nutch_binaries/nutch-nightly/conf/mapred-default.xml 060130 175319 parsing /tmp/nutch/mapred/local/localRunner/job_y60768.xml 060130 175319 parsing file:/nutch_binaries/nutch-nightly/conf/nutch-site.xml 060130 175319 map 0% 060130 175320 crawl/crawldb/current/part-00000/data:0+1958 060130 175320 crawl/crawldb/current/part-00000/data:0+1958 060130 175320 reduce > reduce 060130 175320 reduce 100% 060130 175320 Job complete: job_y60768 060130 175320 Statistics for CrawlDb: crawl/crawldb 060130 175320 TOTAL urls: 20 060130 175320 avg score: 1.339 060130 175320 max score: 1.555 060130 175320 min score: 1.0 060130 175320 retry 0: 20 060130 175320 status 1 (DB_unfetched): 1 060130 175320 status 2 (DB_fetched): 18 060130 175320 status 3 (DB_gone): 1 060130 175320 CrawlDb statistics: done Now, I start tomcat from the crawl dir and then do a search and I get "0" hits...what's up with this. Can someone help? Andy ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
