Sylvain Lebresne created CASSANDRA-8906:
-------------------------------------------

             Summary: Experiment with optimizing partition merging when we can 
prove that some sources don't overlap
                 Key: CASSANDRA-8906
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-8906
             Project: Cassandra
          Issue Type: Improvement
            Reporter: Sylvain Lebresne


When we merge a partition from two sources and it turns out that those 2 
sources don't overlap for that partition, we still end up doing one comparison 
by row in the first source. However, if we can prove that the 2 sources don't 
overlap, for example by using the sstable min/max clustering values that we 
store, we could speed this up. Note that it practice it's little bit more hairy 
because we need to deal with N sources, but that's probably not too hard either.

I'll note that using the sstable min/max clustering values is not terribly 
precise. We could do better if we were to push the same reasoning inside the 
merge iterator, by for instance using the sstable per-partition index, which 
can in theory tell use things like "don't bother comparing rows until the end 
of this row block". This is quite a bit more involved though so maybe note 
worth the complexity.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to