On Jun 11, 2010, at 6:04 PM, Bhupesh Bansal wrote: > How you doing? Heard finally moving away from Solaris and moving to linux :) > Hope things are going well for you !
HP apparently doesn't want us to eval their hardware (at least, by their non response), so at this rate we aren't. :( Maybe they are afraid I'll make it break. ;) [I'll likely stick to Solaris on the NN and JT due to much more sane large page support. That really needs to get fixed in the Linux kernel.] > I think I found the source of my problems, The issue is in Amazon EC2 when I > start my cluster (1 namenode, 16 datanodes) datanodes are not able to talk > to namenode at all (I tried telnet from datanode to namenode) and it gets > fixed progressively and magically in about 30-40 mins when all of them to be > able to talk and hence the safemode taking 40 mins. Oh, weird. I have no practical experience with EC2, so can't really offer any guidance. Tom or someone else might be able to tho.
