[
https://issues.apache.org/jira/browse/KUDU-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15231031#comment-15231031
]
Todd Lipcon commented on KUDU-1400:
-----------------------------------
Making the 2 minute threshold configurable seems like an easy change (it's just
a constant right now).
Merging small DRS (especially those that have sat around for a while) does seem
like a good idea. It would be interesting to consider this along with some
other "lower priority" DRS reorganizations/rewrites such as policies that
switch to denser compression or different storage tiers, even if we dont
implement those features in the short term.
> Improve rowset compaction policy to consider merging small DRSs
> ---------------------------------------------------------------
>
> Key: KUDU-1400
> URL: https://issues.apache.org/jira/browse/KUDU-1400
> Project: Kudu
> Issue Type: Improvement
> Reporter: Binglin Chang
>
> We see some small table with light write load generate lot's of small
> DRS(~1MB), since those DRSes do not overlap much, they don't get the chance
> to be compacted, generating lot of very small files/blocks. So:
> # Compaction solution value should consider benefits of merging small DRS
> # Every 2 min flushing MRS(small or large) seems suboptimal, maybe flushing
> small MRS should have "lower priority" than rowset compaction with higher
> solution value?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)