[
https://issues.apache.org/jira/browse/CASSANDRA-11679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15264315#comment-15264315
]
Benjamin Lerer commented on CASSANDRA-11679:
--------------------------------------------
If I am not mistaken, the default fetch size in the java driver should be 5000.
Which is more than the number of rows that you have. Did you use another value
like 100?
What is the reason for your limit clause? Does it have an impact on the number
of row returned?
> Cassandra Driver returns different number of results depending on fetchsize
> ---------------------------------------------------------------------------
>
> Key: CASSANDRA-11679
> URL: https://issues.apache.org/jira/browse/CASSANDRA-11679
> Project: Cassandra
> Issue Type: Bug
> Components: CQL
> Reporter: Varun Barala
> Assignee: Benjamin Lerer
>
> I'm trying to fetch all distinct keys from a CF using cassandra-driver
> (2.1.7.1) and I observed some strange behavior :-
> The total distinct rows are 498 so If I perform a query get All distinctKeys
> It return 503 instead of 498(five keys twice).
> But If I define the fetch size in select statement more than 498 then it
> returns exact 498 rows.
> And If I execute same statement on Dev-center it returns 498 rows.
> Some Additional and useful information :-
> -------------------------------------------------------
> Cassandra-2.1.13 (C)* version
> Consistency level: ONE
> local machine(ubuntu 14.04)
> Table Schema:-
> ----------------------
> {code:xml}
> CREATE TABLE sample (
> pk1 text,
> pk2 text,
> row_id uuid,
> value blob,
> PRIMARY KEY (( pk1, pk2))
> ) WITH bloom_filter_fp_chance = 0.01
> AND caching = '{"keys":"ALL", "rows_per_partition":"NONE"}'
> AND comment = ''
> AND compaction = {'class':
> 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy'}
> AND compression = {'sstable_compression':
> 'org.apache.cassandra.io.compress.LZ4Compressor'}
> AND dclocal_read_repair_chance = 0.1
> AND default_time_to_live = 0
> AND gc_grace_seconds = 864000
> AND max_index_interval = 2048
> AND memtable_flush_period_in_ms = 0
> AND min_index_interval = 128
> AND read_repair_chance = 0.0
> AND speculative_retry = '99.0PERCENTILE';
> {code}
> query :-
> ------------
> {code:xml}
> SELECT DISTINCT pk2, pk1 FROM sample LIMIT 2147483647;
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)