What are reasonable hardware specifications for a Hadoop node?
Can we document this somewhere (maybe in the wiki as HowToConfigureHardware?)

Obviously this will be a moving target, but some guidance about how much
CPU vs. memory vs. disk space is typical would be helpful.

As one datapoint, we are running some boxes that are 4 core, 64-bit @
2GHz machines with 4GB of memory with [I think] 2 x 750GB disks.  I
think if I could I'd put 4 x 750GB disks in this box.  I believe this
configuration is basically the same as what came up in Yahoo!'s recent
sort benchmark.

Other datapoints anyone?

And what about, say on the namenode?  People talk about it being a
memory bottleneck, but ours is underutilized.

Should we start a wiki page about this?

   -John Heidemann

Reply via email to