Hi,
I have a question on how the splits work on hbase.
I have one master which also acts as a region server, along with other 3
region servers.
I have set the following parameters on all the region servers
<property>
<name>hbase.hregion.max.filesize</name>
<value>1048576</value>
<description>
Maximum HStoreFile size. If any one of a column families' HStoreFiles
has
grown to exceed this value, the hosting HRegion is split in two.
Default: 256M.
</description>
</property>
<property>
<name>hbase.hregion.memstore.flush.size</name>
<value>6291456</value>
<description>
Memstore will be flushed to disk if size of the memstore
exceeds this number of bytes. Value is checked by a thread that runs
every hbase.server.thread.wakefrequency.
</description>
</property>
i have 2 tables where I am loading data and I am expecting 1 M chunk of
files to be created. However if I check the size on DFS, it is creating
around 500K of files:
/hbase/cpu_util_30secs/1693967354/data/2196215602953537657 - 576.7 KB
/hbase/cpu_util_30secs/366365858/data/4815597063578386524 - 640.89 KB
All the table regions that it has created are of this size. However I was
expecting them to be 1M.
Is there some other parameter that I must be tweaking?
Regards,
Mayuresh