Hi Doug,

Did you ever resolve your 0.8 vs 0.7 crawling performance question? I'm
running into a similar problem.

We wound up dramatically increasing the number of threads, which seemed to help solve the bandwidth utilization problem. With Nutch 0.7 we were running about 200 threads per crawler, and with Nutch 0.8 it's more like 2000+ threads...though you have to reduce the thread stack size in this type of configuration.

-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
"Find Code, Find Answers"

Reply via email to