Hello,

the hbase book (http://hbase.apache.org/book.html) suggests to increase
hbase.hregion.max.filesize to a large value. (> 1G) Then there are many
suggestions on mailing list to keep the dfs.block.size set at 64M. What is
the relationship between the two values? How does hbase prevent lots of
network traffic if there are up to 18 dfs blocks per Region.

Right now we operate a cluster with 50 nodes, dfs.block.size=64M and
hbase.hregion.max.filesize=134M. We have one large table that has over 50000
regions. Thats seems way to many. So according to the hbase book we ought to
be able to increase the hbase.hregion.max.filesize=1G and benefit from much
fewer splits. Our cluster is very heavy on writing, that means we currently
are splitting all the time.

Are there any suggestions how to proceed with the configuration of these two
config values.

Thanks for your help.
Matthias

p.s. we are currently in the process of upgrading to clouderas cdh3u0
release.

Reply via email to