Hi Stefan, >Os Version?
Fedora Redhat 9 >Latest nutch cvs version? .4 Nutch from download section >Do you had update the db? I followed the instructions from the tutorial page. I did a subset of the DMOZ file. Roughly 1300 pages. >Do you had indexed you fetch result? Yes...it sat and seemed to crawl all 1300 >How big is the folder size of your index? 9.4 megs .>.. you already fetch so do you... ? > s1=`ls -d segments/2* | tail -1` >bin/nutch updatedb db $s1 >bin/nutch analyze db 5 > bin/nutch index $s1 (!!) >bin/nutch dedup segments dedup.tmp I did everything but the analyze db 5, but then I did the index and deletion of the duplicates. > Can you please post the log messages from catalina? Jun 27, 2004 6:25:20 PM org.apache.coyote.http11.Http11Protocol init INFO: Initializing Coyote HTTP/1.1 on port 8080 Starting service Tomcat-Standalone Apache Tomcat/4.1.30 Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources <init> INFO: Initializing, config='org.apache.struts.util.LocalStrings', returnNull=tru e Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources <init> INFO: Initializing, config='org.apache.struts.action.ActionResources', returnNul l=true Jun 27, 2004 6:25:22 PM org.apache.struts.util.PropertyMessageResources <init> INFO: Initializing, config='org.apache.webapp.admin.ApplicationResources', retur nNull=true Jun 27, 2004 6:25:23 PM org.apache.coyote.http11.Http11Protocol start INFO: Starting Coyote HTTP/1.1 on port 8080 Jun 27, 2004 6:25:23 PM org.apache.jk.common.ChannelSocket init INFO: JK2: ajp13 listening on /0.0.0.0:8009 Jun 27, 2004 6:25:23 PM org.apache.jk.server.JkMain start INFO: Jk running ID=0 time=1/48 config=/usr/local/tomcat/conf/jk2.properties 040627 182541 11 creating new bean 040627 182541 11 loading file:/usr/local/tomcat/webapps/ROOT/WEB-INF/classes/nut ch-default.xml 040627 182541 11 opening segment indexes in /nutch/segments 040627 182541 11 query request from 129.21.61.201 040627 182541 11 query: http 040627 182541 11 total hits: 0 040627 182541 11 done displaying hits [EMAIL PROTECTED] logs]# more foo Jun 27, 2004 6:25:20 PM org.apache.coyote.http11.Http11Protocol init INFO: Initializing Coyote HTTP/1.1 on port 8080 Starting service Tomcat-Standalone Apache Tomcat/4.1.30 Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources <init> INFO: Initializing, config='org.apache.struts.util.LocalStrings', returnNull=true Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources <init> INFO: Initializing, config='org.apache.struts.action.ActionResources', returnNull=true Jun 27, 2004 6:25:22 PM org.apache.struts.util.PropertyMessageResources <init> INFO: Initializing, config='org.apache.webapp.admin.ApplicationResources', returnNull=true Jun 27, 2004 6:25:23 PM org.apache.coyote.http11.Http11Protocol start INFO: Starting Coyote HTTP/1.1 on port 8080 Jun 27, 2004 6:25:23 PM org.apache.jk.common.ChannelSocket init INFO: JK2: ajp13 listening on /0.0.0.0:8009 Jun 27, 2004 6:25:23 PM org.apache.jk.server.JkMain start INFO: Jk running ID=0 time=1/48 config=/usr/local/tomcat/conf/jk2.properties 040627 182541 11 creating new bean 040627 182541 11 loading file:/usr/local/tomcat/webapps/ROOT/WEB-INF/classes/nutch-default.xml 040627 182541 11 opening segment indexes in /nutch/segments 040627 182541 11 query request from 129.21.61.201 040627 182541 11 query: http 040627 182541 11 total hits: 0 040627 182541 11 done displaying hits Thanks for the help. Jason ------------------------------------------------------- This SF.Net email sponsored by Black Hat Briefings & Training. Attend Black Hat Briefings & Training, Las Vegas July 24-29 - digital self defense, top technical experts, no vendor pitches, unmatched networking opportunities. Visit www.blackhat.com _______________________________________________ Nutch-general mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-general
