Eric Newton created ACCUMULO-1802:
-------------------------------------
Summary: use case for future configurability of major compactions
Key: ACCUMULO-1802
URL: https://issues.apache.org/jira/browse/ACCUMULO-1802
Project: Accumulo
Issue Type: Sub-task
Components: tserver
Reporter: Eric Newton
The default compaction strategy has a tendency to put the oldest data in the
largest files. This leads to a lot of work when it is time to age off data.
One could imaging a compaction strategy that would split data into separate
files based on the timestamp. Additionally, if the min/max timestamps for a
file were known, old data could be aged off by deleting whole files.
Augment the configurable compaction strategy to support multiple output files,
and saving/using extra metadata in each file.
--
This message was sent by Atlassian JIRA
(v6.1#6144)