Was there a change in behavior between releases or are you just tuning?
On Aug 10, 2006, at 10:08 AM, Kalbande, Manish wrote:
Hi,
I have a cluster of 21 nodes + 1 name node.
To perform "generate" on crawlDB of size 1 Billion urls with 700
million
unfetched, it took more than 12 hours (most of the time was taken
by Map
tasks), while same thing takes close to 1 hour using hadoop 0.4.
I have not changed any configuration, just added the additional
properties which was added in 0.5.
Are there any (new) properties which I can tweak?
Thanks
Manish