[ 
https://issues.apache.org/jira/browse/HBASE-11644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jingcheng Du updated HBASE-11644:
---------------------------------

    Attachment: HBASE-11644.diff

In this patch, there're two tools, one is a file cleaner which cleans the 
expired mob files by TTL, the other is a sweep tool which merges the small 
files to bigger ones.
The cleaner is run by HMaster periodically, and the sweep tool is triggered by 
users.

> External MOB compaction tools
> -----------------------------
>
>                 Key: HBASE-11644
>                 URL: https://issues.apache.org/jira/browse/HBASE-11644
>             Project: HBase
>          Issue Type: Sub-task
>          Components: Compaction, master
>            Reporter: Jingcheng Du
>            Assignee: Jingcheng Du
>         Attachments: HBASE-11644.diff
>
>
> From the design doc,  mob files are not involved in the normal HBase 
> compaction process.  This means deleted mobs would still take up space and 
> that we never really merge mob files that accrue over time.   Currently, MOBs 
> depend on two external tools:
> 1) A TTL cleaner that removes mobs that have passed their TTL or exceeded 
> minVersions.
> 2) A 'sweep tool' cleaner that remove mobs that have had their references 
> deleted and merges small files into larger ones.  
> Today the tools are triggered by admins.  The longer term goal would be to 
> integrate them into hbase such that by default mobs are cleaned.  The tools 
> will be preserved however so that advanced admins can disable automatic 
> cleanups and manually trigger these compaction like operaitons.  #1 would 
> likely be a chore in the master while #2 requires some design work to 
> integrate into hbase.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to