hi there,
I found some weid behavior of Nutch when I use depth =
10.
For example,
"
bin/nutch crawl url1 -dir crawl2 -depth 10 >&
crawl2.log
"
I did a crawling based on one single web site, with
depth 10. Seems it can't finish.
The subdir with segments/ is 7, if it finished
successfully, should be 10.
And I checked the log, it ends in fetching a website
and didn't go on.
Is there a depth limitation for Nutch?
thanks,
Michael,
__________________________________
Do you Yahoo!?
Read only the mail you want - Yahoo! Mail SpamGuard.
http://promotions.yahoo.com/new_mail
-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers