Hi all,

We have a hbase cluster with 10 region servers running HBase 0.90.0 + CDH3.
We're now importing big data into HBase.

During the process, 2 servers crashed, but after restaring them, they're no
longer assigned with any region, while regions on other servers keep
splitting when more data inserted.

>From the master log, we can see the periodical messages like:

2011-02-18 16:09:35,067 DEBUG org.apache.hadoop.hbase.master.HMaster: Not
running balancer because processing dead regionserver(s):
[zcl.local,60020,1297996817352, qics.local,60020,1297919358488,
Docete.local,60020,1297919410096, liym.local,60020,1297919445796,
zcl.local,60020,1297919367472]

zcl.local and qics.local are the machines we have restared, other 2 machine
have kept running without restarting and are actually still serving regions.

>From the shell status:
10 servers, 5 dead, 10.1000 average Load

Why are there dead servers? And how to clear them so we could start
balancer?

Thanks,
Yi

Reply via email to