Thanks for your reply Julien, for the record, there urls are 126M and they are all from from unique host, no host repeteated here. I'm partitioning by host ( in first round now it's not going to have effect, but for the next rounds ) partitioning 10 urls by host. ¿Can be that the problem? In a moment I will try to do some test again and try jstrack and find if it is partitioning or selection.
Thanks for everything. -- View this message in context: http://lucene.472066.n3.nabble.com/Nutch-1-2-performance-and-memory-issues-tp2407256p2408672.html Sent from the Nutch - User mailing list archive at Nabble.com.

