Hi Stefan,

 >Os Version?

Fedora Redhat 9

 >Latest nutch cvs version?

.4 Nutch from download section

 >Do you had update the db?

I followed the instructions from the tutorial page.  I did a subset of the
DMOZ file.  Roughly 1300 pages.

 >Do you had indexed you fetch result?

Yes...it sat and seemed to crawl all 1300

 >How big is the folder size of your index?

9.4 megs

 .>.. you already fetch so do you... ?
> s1=`ls -d segments/2* | tail -1`
 >bin/nutch updatedb db $s1
 >bin/nutch analyze db 5
> bin/nutch index $s1 (!!)
 >bin/nutch dedup segments dedup.tmp

I did everything but the analyze db 5, but then I did the index and deletion
of the duplicates.

> Can you please post the log messages from catalina?

 Jun 27, 2004 6:25:20 PM org.apache.coyote.http11.Http11Protocol init
INFO: Initializing Coyote HTTP/1.1 on port 8080
Starting service Tomcat-Standalone
Apache Tomcat/4.1.30
Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources
<init>
INFO: Initializing, config='org.apache.struts.util.LocalStrings',
returnNull=tru
e
Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources
<init>
INFO: Initializing, config='org.apache.struts.action.ActionResources',
returnNul
l=true
Jun 27, 2004 6:25:22 PM org.apache.struts.util.PropertyMessageResources
<init>
INFO: Initializing,
config='org.apache.webapp.admin.ApplicationResources', retur
nNull=true
Jun 27, 2004 6:25:23 PM org.apache.coyote.http11.Http11Protocol start
INFO: Starting Coyote HTTP/1.1 on port 8080
Jun 27, 2004 6:25:23 PM org.apache.jk.common.ChannelSocket init
INFO: JK2: ajp13 listening on /0.0.0.0:8009
Jun 27, 2004 6:25:23 PM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=1/48
config=/usr/local/tomcat/conf/jk2.properties
040627 182541 11 creating new bean
040627 182541 11 loading
file:/usr/local/tomcat/webapps/ROOT/WEB-INF/classes/nut
ch-default.xml
040627 182541 11 opening segment indexes in /nutch/segments
040627 182541 11 query request from 129.21.61.201
040627 182541 11 query: http
040627 182541 11 total hits: 0
040627 182541 11 done displaying hits
[EMAIL PROTECTED] logs]# more foo
Jun 27, 2004 6:25:20 PM org.apache.coyote.http11.Http11Protocol init
INFO: Initializing Coyote HTTP/1.1 on port 8080
Starting service Tomcat-Standalone
Apache Tomcat/4.1.30
Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources
<init>
INFO: Initializing, config='org.apache.struts.util.LocalStrings',
returnNull=true
Jun 27, 2004 6:25:21 PM org.apache.struts.util.PropertyMessageResources
<init>
INFO: Initializing, config='org.apache.struts.action.ActionResources',
returnNull=true
Jun 27, 2004 6:25:22 PM org.apache.struts.util.PropertyMessageResources
<init>
INFO: Initializing,
config='org.apache.webapp.admin.ApplicationResources', returnNull=true
Jun 27, 2004 6:25:23 PM org.apache.coyote.http11.Http11Protocol start
INFO: Starting Coyote HTTP/1.1 on port 8080
Jun 27, 2004 6:25:23 PM org.apache.jk.common.ChannelSocket init
INFO: JK2: ajp13 listening on /0.0.0.0:8009
Jun 27, 2004 6:25:23 PM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=1/48
config=/usr/local/tomcat/conf/jk2.properties
040627 182541 11 creating new bean
040627 182541 11 loading
file:/usr/local/tomcat/webapps/ROOT/WEB-INF/classes/nutch-default.xml
040627 182541 11 opening segment indexes in /nutch/segments
040627 182541 11 query request from 129.21.61.201
040627 182541 11 query: http
040627 182541 11 total hits: 0
040627 182541 11 done displaying hits

Thanks for the help.

Jason



-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to