[ 
https://issues.apache.org/jira/browse/CASSANDRA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stu Hood updated CASSANDRA-1472:
--------------------------------

    Attachment: v4-bench-c32.txt
                v4-bench-c50.txt

Attaching some benchmark numbers generated by CASSANDRA-1531.

{{v4-bench-c32.txt}} is for cardinality == 32, which is a bit of a magic number 
in the current implementation of KEYS_BITMAP indexes: it's the maximum number 
of bins that our current implementation will create, meaning that every value 
gets its own bin, so there are no false positives returned by the sstables.

{{v4-bench-c50.txt}} is for cardinality == 50, meaning that some of the bins 
are multi-valued, leading to a larger number of false positives. Since our bin 
selection is still naive, it is generating lumpy buckets: you can see a long 
tail for KEYS_BITMAP reads, since one thread got stuck with higher cardinality 
bins.

----

TODOs:
* Integrate with 1415 now that it is in trunk
* Improve bin selection to eliminate lumpy bins (as seen with c==50)
* Replace Memtable brute-force filtering with an in-memory index attached to 
the Memtable (I didn't do any tests with memtables, but I'm assuming this 
should still be a priority)
* Implement the "OrBiC" projection described in the paper to eliminate false 
positives
** Multi-valued bins would have their values projected into another SSTable 
component

> Add bitmap secondary indexes
> ----------------------------
>
>                 Key: CASSANDRA-1472
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1472
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Stu Hood
>             Fix For: 0.7.0
>
>         Attachments: 1472-v3.tgz, 1472-v4.tgz, anatomy.png, v4-bench-c32.txt, 
> v4-bench-c50.txt
>
>
> Bitmap indexes are a very efficient structure for dealing with immutable 
> data. We can take advantage of the fact that SSTables are immutable by 
> attaching them directly to SSTables as a new component (supported by 
> CASSANDRA-1471).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to