Okay, I think I have done a good crawl and when I do a stats command I
get this...

[EMAIL PROTECTED] nutch-nightly]# bin/nutch readdb crawl/crawldb -stats
060130 175317 CrawlDb statistics start: crawl/crawldb
060130 175317 parsing
file:/nutch_binaries/nutch-nightly/conf/nutch-default.xml
060130 175318 parsing
file:/nutch_binaries/nutch-nightly/conf/mapred-default.xml
060130 175318 parsing
file:/nutch_binaries/nutch-nightly/conf/nutch-site.xml
060130 175318 parsing
file:/nutch_binaries/nutch-nightly/conf/nutch-default.xml
060130 175318 parsing
file:/nutch_binaries/nutch-nightly/conf/nutch-site.xml
060130 175318 Running job: job_y60768
060130 175318 parsing
file:/nutch_binaries/nutch-nightly/conf/nutch-default.xml
060130 175319 parsing
file:/nutch_binaries/nutch-nightly/conf/mapred-default.xml
060130 175319 parsing /tmp/nutch/mapred/local/localRunner/job_y60768.xml
060130 175319 parsing
file:/nutch_binaries/nutch-nightly/conf/nutch-site.xml
060130 175319  map 0%
060130 175320 crawl/crawldb/current/part-00000/data:0+1958
060130 175320 crawl/crawldb/current/part-00000/data:0+1958
060130 175320 reduce > reduce
060130 175320  reduce 100%
060130 175320 Job complete: job_y60768
060130 175320 Statistics for CrawlDb: crawl/crawldb
060130 175320 TOTAL urls:       20
060130 175320 avg score:        1.339
060130 175320 max score:        1.555
060130 175320 min score:        1.0
060130 175320 retry 0:  20
060130 175320 status 1 (DB_unfetched):  1
060130 175320 status 2 (DB_fetched):    18
060130 175320 status 3 (DB_gone):       1
060130 175320 CrawlDb statistics: done

Now, I start tomcat from the crawl dir and then do a search and I get
"0" hits...what's up with this. 
Can someone help?

Andy


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to