Re: Can we replace namenode machine with some other machine ?

Steve Loughran Thu, 22 Sep 2011 03:29:30 -0700

On 22/09/11 05:42, praveenesh kumar wrote:

Hi all,


Can we replace our namenode machine later with some other machine. ?
Actually I got a new  server machine in my cluster and now I want to make
this machine as my new namenode and jobtracker node ?
Also Does Namenode/JobTracker machine's configuration needs to be better
than datanodes/tasktracker's ??

1. I'd give it lots of RAM - holding data about many files, avoidingswapping, etc.

2. I'd make sure the disks are RAID5, with some NFS-mounted FS that thesecondary namenode can talk to. avoids risk of loss of the index, which,if it happens, renders your filesystem worthless. If I was reallyparanoid I'd have twin raid controllers with separate connections todisk arrays in separate racks, as [Jiang2008] shows that interconnectproblems on disk arrays can be higher than HDD failures.

3. if your central switches are at 10 GbE, consider getting a 10GbE NICand hooking it up directly -this stops the network being the bottleneck,though it does mean the server can have a lot more packets hitting it,so putting more load on it.


4. Leave space for a second CPU and time for GC tuning.

JT's are less important; they need RAM but use HDFS for storage. If yourcluster is small, NN and JT can be run locally. If you do this, set upDNS to have two hostnames to point to same network address. Then if youever split them off, everyone whose bookmark says http://jobtrackerwon't notice

Either way: the NN and the JT are the machines whose availability youcare about. The rest is just a source of statistics you can look at later.


-Steve

[Jiang2008] "Are disks the dominant contributor for storage failures?: Acomprehensive study of storage subsystem failure characteristics". ACMTransactions on Storage.

Re: Can we replace namenode machine with some other machine ?

Reply via email to