[
https://issues.apache.org/jira/browse/CASSANDRA-3442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13142006#comment-13142006
]
Sylvain Lebresne commented on CASSANDRA-3442:
---------------------------------------------
I think the idea has merits.
We have to keep in mind that unless the expired data is also gcable, this would
only transform expiring columns to tombstones, so for those that have very
small columns (~4 bytes) this would be useless. I would also maybe bump that
20% up to be sure we don't rewrite too ofen sstables that are good candidate
for compaction anyway. But on principle this is a good idea I believe.
> TTL histogram for sstable metadata
> ----------------------------------
>
> Key: CASSANDRA-3442
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3442
> Project: Cassandra
> Issue Type: Improvement
> Components: Core
> Reporter: Jonathan Ellis
> Assignee: Sylvain Lebresne
> Priority: Minor
> Labels: compaction
>
> Under size-tiered compaction, you can generate large sstables that compact
> infrequently. With expiring columns mixed in, we could waste a lot of space
> in this situation.
> If we kept a TTL EstimatedHistogram in the sstable metadata, we could do a
> single-sstable compaction aginst sstables with over 20% (?) expired data.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira