Re: NameNode hardware specs

Steve Loughran Fri, 15 Aug 2008 04:38:24 -0700

Allen Wittenauer wrote:



On 8/12/08 12:07 PM, "lohit" <[EMAIL PROTECTED]> wrote:

 - why RAID5?
- If running RAID 5, why is this necessary?

Not absolute necessary.


    I'd be afraid of the write penalty of RAID5 vs, say, RAID10 or even just
plain RAID1.

    For the record, I don't think we have any production systems except
maybe one that uses any sort of RAID methods on the name node.

    I'm sure Steve will pop up at some point and explain his reasoning on
this one. ;)


sorry, I've been away.

-the main thing is that the namenode is the point of failure for thecluster; its the one to spend the money on, to have hooked up to yourpager when it goes offline, and to nurture like it matters.

-Whatever you can do to keep those disks alive matter, and that usuallymeans some RAID backup of the namenode data. Though then you have toworry about the RAID controller itself, which is not always abovefailing iself, as people can see if they search for the phrase "raidcontroller failure".

-Don't bother with duplication on the worker machines, because they areexpendable and storage hurts your capital and power budgets.

One story I've head of is tracking every disk's history, even if it getsmoved around, and giving it different workload depending on its expectedfailure. New disks may have more capacity, but during that bedding inperiod, not to be trusted, so for the first couple of months, they'reused for transient cache data, not important persistent stuff. If you'rehappy, use them for important data, but later on, when their peers fromthe same batch start failing, it's time to view them as unsafe anddowngrade again. [this is all anecdotal, no paper on the topic thoughthere are some looking at MTBF issues in datacentres]


-steve

Re: NameNode hardware specs

Reply via email to