Dear all,

We have a small test cluster with 5 nodes, 1 master and 4 datanodes. The nodes 
are installed with Ubuntu desktop 10.10, hadoop version 'Hadoop 0.20.2-CDH3B4' 
and hbase version 0.90.1-CDH3B4. The hbase database is well balanced and 
contains one table (TAB_1) containing 270.000.000 data records. The table 
consists of 84 regions each with 1 up to 3 storefiles and 100Mbyte -> 216 Mbyte 
of size for the regions. The rowkey is a monotonic raising timestamp, wich I 
know is bad for parallelization but we are only testing some map features so 
far.

When I create TAB_1 it distributes very good over the 4 region servers, so that 
each server contains 20 - 22 regions after creation. When I create a second 
table (TAB_2) with the same rowkey and the same data this table does not 
distribute over the servers, but is only stored on one of the regionserver 
(R1). The other nodes (R2, R3, R4) are not used for storage. The cluster still 
remains balanced but I can see drifting regions from TAB_1 away from R1 which 
used for storing TAB_2. After a while there are no regions of TAB_1 left on R1 
and now the load balancer starts moving regions of TAB_2 to R2 .. R4. The 
active region that is written into remains on R1.

How can this behavious be explained. I normally would expect that TAB_2 will 
distribute over all 4 regionservers when creating and would not be stored on 
one of the servers and have the load balancer in the background shift the data.

Is this a normal hbase behaviour or is there some missconfiguration in my 
cluster?

Thanks in advance

Christian

---------------8<--------------------------------

Siemens AG
Corporate Technology
Corporate Research and Technologies
CT T DE IT3
Otto-Hahn-Ring 6
81739 München, Deutschland
Tel.: +49 (89) 636-42722
Fax: +49 (89) 636-41423
mailto:[email protected]

Siemens Aktiengesellschaft: Vorsitzender des Aufsichtsrats: Gerhard Cromme; 
Vorstand: Peter Löscher, Vorsitzender; Wolfgang Dehen, Brigitte Ederer, Joe 
Kaeser, Barbara Kux, Hermann Requardt, Siegfried Russwurm, Peter Y. Solmssen; 
Sitz der Gesellschaft: Berlin und München, Deutschland; Registergericht: Berlin 
Charlottenburg, HRB 12300, München, HRB 6684; WEEE-Reg.-Nr. DE 23691322



Reply via email to