[ 
https://issues.apache.org/jira/browse/HBASE-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Kyle Purtell closed HBASE-911.
-------------------------------------

> Minimize filesystem footprint
> -----------------------------
>
>                 Key: HBASE-911
>                 URL: https://issues.apache.org/jira/browse/HBASE-911
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Michael Stack
>            Priority: Major
>
> This issue is about looking into how much space in filesystem hbases uses.  
> Daniel Ploeg suggests that hbase is profligate in its use of space in hdfs.   
> Given that block sizes by default are 64MB, and that every time hbase writes 
> a store file that its accompanied by an index file and a very small metadata 
> file, thats 3*64MB even if the file is empty (TODO: Prove this).  The 
> situation is aggrevated by the fact that hbase does a flush of whatever is in 
> memory every 30 minutes to minimize loss in the absence of appends; this 
> latter action makes for lots of small files.
> The solution to the above is implement append so optional flush is not 
> necessary and a file format that aggregates info, index and data all in the 
> one file.   Short-term, we should set block size on the info/metadata file 
> down to 4k or some such small size and look into doing likewise for the 
> mapfile index.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to