[
https://issues.apache.org/jira/browse/CASSANDRA-4165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jonathan Ellis updated CASSANDRA-4165:
--------------------------------------
Component/s: Core
Fix Version/s: (was: 2.0.4)
2.1
It turns out that uncompressed writes spend about 20% of their time computing
the sha.
I think we should switch to adler here as well as at the block level. (I note
that adler-capable commandline tools exist for RHEL and Debian as well as
scripting languages Perl, Python, Ruby.)
(Still need to avoid computing redundant CRC component for compressed files
too.)
/cc [~pkolaczk] [~benedict]
> Generate Digest file for compressed SSTables
> --------------------------------------------
>
> Key: CASSANDRA-4165
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4165
> Project: Cassandra
> Issue Type: Improvement
> Components: Core
> Reporter: Marcus Eriksson
> Assignee: Marcus Eriksson
> Priority: Minor
> Labels: performance
> Fix For: 2.1
>
> Attachments: 0001-Generate-digest-for-compressed-files-as-well.patch,
> 4165-rebased.txt
>
>
> We use the generated *Digest.sha1-files to verify backups, would be nice if
> they were generated for compressed sstables as well.
--
This message was sent by Atlassian JIRA
(v6.1#6144)