[jira] Commented: (CASSANDRA-1046) optimize Memtable.getSliceIterator

Jonathan Ellis (JIRA) Fri, 04 Jun 2010 13:43:18 -0700

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-1046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12875739#action_12875739
 ]


Jonathan Ellis commented on CASSANDRA-1046:
-------------------------------------------

to clarify, the "these changes" that made it fast is 
0001-trunk-cassandra-1046.patch ?  or some combination of the patch, and 
improved client code?

> optimize Memtable.getSliceIterator
> ----------------------------------
>
>                 Key: CASSANDRA-1046
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1046
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jonathan Ellis
>            Assignee: Matthew F. Dennis
>             Fix For: 0.7
>
>         Attachments: 0001-trunk-cassandra-1046.patch, insertarator.py, 
> readarator.py
>
>
> As reported by James Golick, about 30% of the time in a read is spent in 
> SliceQueryFilter.getMemColumnIterator, virtually all of which is in 
> ConcurrentSkipListMap$Values.toArrray().
> I wrote on the ML:
> Besides the UUID optimization you posted, we should do an audit of 
> ColumnFamily.getSortedColumns and replace with iteration where possible (in 
> this case, we'd be left with one copy of most of the columns, but that's 
> better than two).
> We can get rid of the other copy by fixing the logic in 
> Memtable.getSliceIterator, which says "copy all the columns, so we can do a 
> binary search on them to find where to start," but since columns are natively 
> in sorted order we could just use an iterator and a while loo

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (CASSANDRA-1046) optimize Memtable.getSliceIterator

Reply via email to