[
https://issues.apache.org/jira/browse/CASSANDRA-47?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062730#comment-13062730
]
Pavel Yaskevich commented on CASSANDRA-47:
------------------------------------------
bq. Why would the uncompressed size for -C 250 be different from the
uncompressed size for -C 50: 3.8 GB vs the 1.7 GB from before?
I did simply choose the largest files from individual compactions.
Also I don't know why you getting only 3.7GB of data after `-S 1024 -n 1000000
-C 250 -V` because in all of my tests I get about 5.1GB which current trunk
code and with patch applied.
My results:
||build||disk volume (bytes)||write runtime (s)||read runtime (s)||read ops/s||
||trunk||5,241,718,144||166||2210||~450||
||#47||5,090,003,162 (1.2GB of blocks aka real size)||156||480||~2100||
Both sizes are after last major compaction, cassandra with default
configuration running on Quad-Core AMD Opteron(tm) Processor 2374 HE with
4229730MHz on each core, 2GB RAM - Debian 5.0 (Lenny) (kernel 2.6.35.4-rscloud)
hosted on rackspace.
> SSTable compression
> -------------------
>
> Key: CASSANDRA-47
> URL: https://issues.apache.org/jira/browse/CASSANDRA-47
> Project: Cassandra
> Issue Type: New Feature
> Components: Core
> Reporter: Jonathan Ellis
> Assignee: Pavel Yaskevich
> Labels: compression
> Fix For: 1.0
>
> Attachments: CASSANDRA-47.patch, snappy-java-1.0.3-rc4.jar
>
>
> We should be able to do SSTable compression which would trade CPU for I/O
> (almost always a good trade).
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira