[ 
https://issues.apache.org/jira/browse/CASSANDRA-9211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14502687#comment-14502687
 ] 

Björn Hegerfors commented on CASSANDRA-9211:
--------------------------------------------

I have started with a first pass at this system table in this branch: 
https://github.com/Bj0rnen/cassandra/tree/Bj0rnen/SSTableMetadataHistory. I 
haven't tried yet, but figuring out when to do insertions to this table could 
be difficult. I think it should be when the SSTable first gets loaded by 
Cassandra, but if you restart Cassandra, I guess all SSTables will be loaded 
again, and then it seems unnecessary to insert them another time. But it 
wouldn't be too bad. If anyone can point me to a good place for writing to this 
table, that would be helpful! I'm not only interested in SSTables that come 
from compaction, since I want to know metadata about the ones that are the 
input to a compaction operation, not just the output.

> Keep history of SSTable metadata
> --------------------------------
>
>                 Key: CASSANDRA-9211
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-9211
>             Project: Cassandra
>          Issue Type: Wish
>            Reporter: Björn Hegerfors
>
> Similar to the request in CASSANDRA-8078, I'm interested in SSTables' 
> lineage. Specifically, I want to visualize the behaviors of compaction 
> strategies based on real data. For example, for STCS I might want to generate 
> something like this image: 
> http://www.datastax.com/wp-content/uploads/2011/10/size-tiered-1.png. For LCS 
> and DTCS, other properties than size are interesting.
> As Marcus responded in CASSANDRA-8078, there is already tracking of ancestors 
> in the SSTable metadata. But as far as I know, the metadata gets garbage 
> collected along with the SSTable itself. So what I propose is to persist 
> metadata in a system table. Maybe some, maybe all metadata. Like the 
> compaction_history table, this should have a default TTL of something like 
> one week or just one day. But users can freely modify/remove the TTL.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to