[jira] Commented: (CASSANDRA-1207) Don't write BloomFilters for skinny rows

Stu Hood (JIRA) Sat, 19 Jun 2010 14:15:47 -0700

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880551#action_12880551
 ]


Stu Hood commented on CASSANDRA-1207:
-------------------------------------

> i'm skeptical that reading an index block's worth of columns is cheaper than 
> reading a bloom filter, even for skinny rows
Well, maybe "index block" isn't the correct threshold to make this decision 
at... I'll do some testing.

I marked this as a critical improvement because for 5 columns, I saw > 25% 
improvement in compaction speed and disk usage.

> Don't write BloomFilters for skinny rows
> ----------------------------------------
>
>                 Key: CASSANDRA-1207
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1207
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Stu Hood
>            Priority: Minor
>             Fix For: 0.7
>
>         Attachments: 
> 0001-Return-alwaysMatchingBloomFilter-for-0-length-filter.patch, 
> 0002-Conditionally-write-the-row-bloom-filter.patch
>
>
> All rows currently contain a serialized BloomFilter, regardless of size. For 
> smaller rows, it is much more efficient in space and CPU time to not write a 
> BloomFilter, and to eagerly perform lookups against the existing columns.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (CASSANDRA-1207) Don't write BloomFilters for skinny rows

Reply via email to