Thanks for your reply Julien,
for the record, there urls are 126M and they are all from from unique host,
no host repeteated here. I'm partitioning by host ( in first round now it's
not going to have effect, but for the next rounds ) partitioning 10 urls by
host. 
¿Can be that the problem?
In a moment I will try to do some test again and try jstrack and find if it
is partitioning or selection.

Thanks for everything.


-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-1-2-performance-and-memory-issues-tp2407256p2408672.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to