Hi, We're running Hadoop cluster on 4 nodes, our primary purpose of running is to provide distributed storage solution for internal applications here in TellyTopia Inc.
Our cluster consists of non-identical nodes (one with 1TB another two with 3 TB and one more with 60GB) while copying data on HDFS we noticed that node with 60GB storage ran out of disk-space and even balancer couldn't balance because cluster was stopped. Now my questions are 1. Is Hadoop is suitable for non-identical cluster nodes? 2. Is there any way to automatically balancing of nodes? 3. Why Hadoop cluster stops when one node ran our of disk? Any futher inputs are appericiapted! Cheers, Deepak TellyTopia Inc.