On Sun, Jan 16, 2011 at 8:39 AM, Mag Gam <magaw...@gmail.com> wrote:

> I am curious now...
>
> If you have a cluster the size of 10, what should the heartbeat be set
> as? What about 100, 1000?
>

The heartbeat interval auto-tunes based on size of the cluster. I've never
seen anyone tune these settings (yet)


>
>
> I too am interested in tuning documentation.  For example, how much
> memory should we allocate to JVM? How much memory for namenode? etc...
>
>
My rough guide (with a decent fudge factor built in) is 1GB RAM on NN per
million files. This is obviously a rule of thumb since the amount of RAM
taken by a file depends on a lot of factors (number of blocks, length of
filename, etc) but as a rough guide it should give you a decent idea.

Konstantin Shvachko did some better analysis on this a few months back:

http://www.usenix.org/publications/login/2010-04/openpdfs/shvachko.pdf

His number is 0.6GB per 1M files, but not sure if that includes a "safety
factor".

-Todd
-- 
Todd Lipcon
Software Engineer, Cloudera

Reply via email to