Hello, the hbase book (http://hbase.apache.org/book.html) suggests to increase hbase.hregion.max.filesize to a large value. (> 1G) Then there are many suggestions on mailing list to keep the dfs.block.size set at 64M. What is the relationship between the two values? How does hbase prevent lots of network traffic if there are up to 18 dfs blocks per Region.
Right now we operate a cluster with 50 nodes, dfs.block.size=64M and hbase.hregion.max.filesize=134M. We have one large table that has over 50000 regions. Thats seems way to many. So according to the hbase book we ought to be able to increase the hbase.hregion.max.filesize=1G and benefit from much fewer splits. Our cluster is very heavy on writing, that means we currently are splitting all the time. Are there any suggestions how to proceed with the configuration of these two config values. Thanks for your help. Matthias p.s. we are currently in the process of upgrading to clouderas cdh3u0 release.
