[
https://issues.apache.org/jira/browse/CASSANDRA-14587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paulo Motta updated CASSANDRA-14587:
------------------------------------
Summary: Deduplicate sstables shared by multiple snapshots when computing
true disk space used (was: TrueDiskSpaceUsed overcounts snapshots)
> Deduplicate sstables shared by multiple snapshots when computing true disk
> space used
> -------------------------------------------------------------------------------------
>
> Key: CASSANDRA-14587
> URL: https://issues.apache.org/jira/browse/CASSANDRA-14587
> Project: Cassandra
> Issue Type: Improvement
> Components: Tool/nodetool
> Environment: Debian 8
> Cassandra 3.11.2
> Reporter: Elliott Sims
> Priority: Low
>
> Running 'nodetool listsnapshots' seems to overcount "TrueDiskSpaceUsed" under
> some circumstances. Specifically when there's a large number of snapshots.
> I suspect that it's not deduplicating space used when multiple snapshots
> share sstables that are not part of the current table.
> Results of "nodetool listsnapshots":
> Total TrueDiskSpaceUsed: 396.11 MiB
> Results of "du -hcs" on the table's directory:
> 18M total
> This is 50+ snapshots (every minute) run with "-t <datestamp> -sf
> --column-family <tablename> <keyspace>"
> The results of a "du -hcs -L <directory" come out pretty close to the
> "TrueDiskSpaceUsed"
> I have only tested against 3.11.2, but have no reason to believe it's unique
> to that version or even 3.x.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]