huaxiang sun created HBASE-17172:
------------------------------------
Summary: Optimize major mob compaction with _del files
Key: HBASE-17172
URL: https://issues.apache.org/jira/browse/HBASE-17172
Project: HBase
Issue Type: Improvement
Components: mob
Affects Versions: 2.0.0
Reporter: huaxiang sun
Assignee: huaxiang sun
Today, when there is a _del file in mobdir, with major mob compaction, every
mob file will be recompacted, this causes lots of IO and slow down major mob
compaction (may take months to finish). This needs to be improved. A few ideas
are:
1) Do not compact all _del files into one, instead, compact them based on
groups with startKey as the key. Then use firstKey/startKey to make each mob
file to see if the _del file needs to be included for this partition.
2). Based on the timerange of the _del file, compaction for files after that
timerange does not need to include the _del file as these are newer files.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)