This is a known bug and is fixed in 90.2. Cloudera's CDH3B4 is 90.1. They are aware of this issue and if it impacts you, you may want to ask them to raise its priority for Todd L. to back port the fix. (Now Todd is going to hate me for making him do more work :-) )
-Mike > Date: Fri, 18 Mar 2011 13:46:30 +0200 > Subject: Re: Table distribution > From: [email protected] > To: [email protected] > > Hi, > > I have come across the exact same issue a while ago. I am no expert but it > seems balancer is not reassigning newly split regions, in order to reduce > client disconnect problems, selecting other regions on the RS instead. I > have seen a JIRA about this behaviour but i can't remember the id. Anyways, > this is not a real solution but i overcame it by disabling unused tables. > Still, i too am curious about some expert opinion. > > -- > erdem > > On Fri, Mar 18, 2011 at 1:08 PM, Iulia Zidaru <[email protected]> wrote: > > > > > > > Hi all, > > We are using ASF HBase 0.90 with cloudera distribution for HDFS(cdh3b3)). > > We have a cluster with 6 machines 1135 regions on each machine. > > We have many tables, each of them having regions on many nodes. We've > > created a new table and started to load it. The other tables are not used > > anymore. The problem is that all regions of the new table are on the same > > machine(about 150 regions) and it is really loaded! The entire cluster is > > still well distributed (the same number of regions on each machine), but it > > seems that only old tables are redistributed. Also, the table's data is > > distributed in HDFS across the entire cluster. > > Do you have any idea what is wrong with it? > > Thank you, > > Iulia > > > > > > > > > > > > > > -- > > Iulia Zidaru > > Java Developer > > > > 1&1 Internet AG - Bucharest/Romania - Web Components Romania > > 18 Mircea Eliade St > > Sect 1, Bucharest > > RO Bucharest, 012015 > > [email protected] > > 0040 31 223 9153 > > > > > > > > > > -- > erdem agaoglu
