Yifan Cai created CASSANDRA-19325:
-------------------------------------
Summary: [Analytics] Fix range splitting that can produce
overlapping ranges
Key: CASSANDRA-19325
URL: https://issues.apache.org/jira/browse/CASSANDRA-19325
Project: Cassandra
Issue Type: Bug
Reporter: Yifan Cai
Assignee: Yifan Cai
The range splitting implementation can produce the following false results.
- Given a tiny range, it can produce duplicated ranges, leading to spark
executors working on the same data set.
- The produced ranges are closed on both ends, making the same tokens being
shared by 2 ranges, leading to data duplication.
Besides the splitting error, during write process, the split sub-ranges are
unordered. It does not seem to affect correctness, but can be confusing.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]