Re: HDFS safemode recovery take more than an hour

Allen Wittenauer Fri, 11 Jun 2010 18:14:40 -0700

On Jun 11, 2010, at 6:04 PM, Bhupesh Bansal wrote:

> How you doing? Heard finally moving away from Solaris and moving to linux :)
> Hope things are going well for you !


HP apparently doesn't want us to eval their hardware (at least, by their non 
response), so at this rate we aren't. :( Maybe they are afraid I'll make it 
break. ;)   [I'll likely stick to Solaris on the NN and JT due to much more 
sane large page support.  That really needs to get fixed in the Linux kernel.]

> I think I found the source of my problems, The issue is in Amazon EC2 when I
> start my cluster (1 namenode, 16 datanodes) datanodes are not able to talk
> to namenode at all (I tried telnet from datanode to namenode) and it gets
> fixed progressively and magically in about 30-40 mins when all of them to be
> able to talk and hence the safemode taking 40 mins.

Oh, weird.  I have no practical experience with EC2, so can't really offer any 
guidance.  Tom or someone else might be able to tho.

Re: HDFS safemode recovery take more than an hour

Reply via email to