[ 
https://issues.apache.org/jira/browse/CASSANDRA-674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13080757#comment-13080757
 ] 

Stu Hood commented on CASSANDRA-674:
------------------------------------

I reran the test mentioned in [#comment-13054228] with replicate-on-write 
disabled, which makes for a much more fair comparison (trunk/47 require 2 seeks 
to miss for a column, and 3 to hit). This version of trunk also includes 
CASSANDRA-47 snappy compression.

|| build || disk volume (bytes) || bytes per column || runtime (s) || 
throughput (ops/s) || avg read ms || 99th % read ms ||
| trunk - uncompressed | 16,713,328,798 | 66.8 | 6154 | 40620 | 2.54 | 6 |
| trunk - gz 6 * | 2,747,319,000 | 10.98 |-|-|-|-|
| trunk - [snappy|https://issues.apache.org/jira/browse/CASSANDRA-47] | 
4,356,461,652 | 17.4 | 7906 | 31618 | 4.64 | 15 |
| 674+2319 | 2,675,888,207 | 10.7 | 7703 | 32454 | 3.04 | 10 |
\* _trunk - gz 6_ is the size of compressing the data directory of the trunk 
result at GZIP level 6

In this workload, we're reading from the tail of the row, which means that 
CASSANDRA-47 needs to decode two blocks per read (one for the row index at the 
head of the row, and one for the columns at the tail).

> New SSTable Format
> ------------------
>
>                 Key: CASSANDRA-674
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-674
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Stu Hood
>             Fix For: 1.0
>
>         Attachments: 674-v1.diff, 674-v2.tgz, 674-v3.tgz, 674-ycsb.log, 
> trunk-ycsb.log
>
>
> Various tickets exist due to limitations in the SSTable file format, 
> including #16, #47 and #328. Attached is a proposed design/implementation of 
> a new file format for SSTables that addresses a few of these limitations.
> This v2 implementation is not ready for serious use: see comments for 
> remaining issues. It is roughly the format described here: 
> http://wiki.apache.org/cassandra/FileFormatDesignDoc 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to