Hi , I am curious to know what is the biggest problem of extremely large hadoop cluster. What I can imagine now is the memory cost of meta data of hdfs in name node. One solution I can think about now is to use other storage implementation such as database to store the metadata, although it has performance cost. Is there any other solutions or any problems of extremely large hadoop cluster ?
-- Best Regards Jeff Zhang