hi there,
 
I found some weid behavior of Nutch when I use depth
=
10.
 
 For example,
 
 "
 bin/nutch crawl url1 -dir crawl2 -depth 10 >&
 crawl2.log
 "
 
 I did a crawling based on one single web site, with
 depth 10. Seems it can't finish.
 
 The subdir with segments/ is 7, if it finished
 successfully, should be 10.
 
 And I checked the log, it ends in fetching a website
 and didn't go on.
 
 Is there a depth limitation for Nutch?
 
 thanks,
 
 Michael,
 
 
 
                



                
__________________________________ 
Do you Yahoo!? 
Yahoo! Mail - Find what you need with new enhanced search. 
http://info.mail.yahoo.com/mail_250


-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to