Re: Master Heap Size and Master Startup Time vs. Number of Blocks

Doug Cutting Fri, 02 May 2008 13:15:20 -0700

Cagdas Gerede wrote:

We will have 5 million files each having 20 blocks of 2MB. With the minimum
replication of 3, we would have 300 million blocks.
300 million blocks would store 600TB. At ~10TB/node, this means a 60 node
system.


Do you think these numbers are suitable for Hadoop DFS.

Why are you using such small blocks? A larger block size will decreasethe strain on Hadoop, but perhaps you have reasons?


Doug

Re: Master Heap Size and Master Startup Time vs. Number of Blocks

Reply via email to