[ 
https://issues.apache.org/jira/browse/CASSANDRA-10010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15119631#comment-15119631
 ] 

Sylvain Lebresne commented on CASSANDRA-10010:
----------------------------------------------

+1

> Paging on DISTINCT queries repeats result when first row in partition changes
> -----------------------------------------------------------------------------
>
>                 Key: CASSANDRA-10010
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-10010
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Tyler Hobbs
>            Assignee: Benjamin Lerer
>            Priority: Minor
>             Fix For: 2.2.x
>
>         Attachments: 10010-2.2.txt
>
>
> When paging, we always check new pages to see if they start with the same row 
> that the previous page ended with, and if so, we trim that row to avoid 
> duplicates.  With {{DISTINCT}} queries, we only fetch the first row in each 
> partition.  If that row happens to change (it's deleted, or another row is 
> inserted at the front of the partition) in between fetching the two pages, 
> our check for a matching row will fail, resulting in a duplicate row being 
> returned.
> It seems like the correct fix is to handle {{DISTINCT}} queries specially and 
> only check to see if the partition key matches the last returned one instead 
> checking that the rows match.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to