Hi all, We have a hbase cluster with 10 region servers running HBase 0.90.0 + CDH3. We're now importing big data into HBase.
During the process, 2 servers crashed, but after restaring them, they're no longer assigned with any region, while regions on other servers keep splitting when more data inserted. >From the master log, we can see the periodical messages like: 2011-02-18 16:09:35,067 DEBUG org.apache.hadoop.hbase.master.HMaster: Not running balancer because processing dead regionserver(s): [zcl.local,60020,1297996817352, qics.local,60020,1297919358488, Docete.local,60020,1297919410096, liym.local,60020,1297919445796, zcl.local,60020,1297919367472] zcl.local and qics.local are the machines we have restared, other 2 machine have kept running without restarting and are actually still serving regions. >From the shell status: 10 servers, 5 dead, 10.1000 average Load Why are there dead servers? And how to clear them so we could start balancer? Thanks, Yi
