[
https://issues.apache.org/jira/browse/FLINK-26822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17573185#comment-17573185
]
Jesus H Christ edited comment on FLINK-26822 at 7/29/22 11:07 PM:
------------------------------------------------------------------
I was eagerly awating these changes in official Flink releases, thanks
[~echauchot] . One note, I did find the ds-bulk-partitioner classes useful
while writing SplitEnumerator, and based Splits on RingRanges just like you did
in the implementation I tried.
[https://github.com/datastax/dsbulk/blob/1.x/partitioner/src/main/java/com/datastax/oss/dsbulk/partitioner/PartitionGenerator.java]
They do note some differences to the Spark connector:
{noformat}
* <p>Note: the grouping algorithm used in DSBulk is different from the one
used by the Spark
* connector: the former favors groups of contiguous ranges sharing the same
replicas (in order to
* make it possible to route a range read to a coordinator that is also a
replica), whereas the
* latter favors data locality (i.e. groups even non-continguous ranges as long
as they share at
* least one common replica).{noformat}
was (Author: JIRAUSER288497):
I was eagerly awating these changes in official Flink releases, thanks
[~echauchot] . One note, I did find the ds-bulk-partitioner classes useful
while writing SplitEnumerator, and based Splits on RingRanges just like you did
in the implementation I tried.
[https://github.com/datastax/dsbulk/blob/1.x/partitioner/src/main/java/com/datastax/oss/dsbulk/partitioner/PartitionGenerator.java]
> Add Source implementation for Cassandra connector
> -------------------------------------------------
>
> Key: FLINK-26822
> URL: https://issues.apache.org/jira/browse/FLINK-26822
> Project: Flink
> Issue Type: Sub-task
> Components: Connectors / Cassandra
> Reporter: Martijn Visser
> Assignee: Etienne Chauchot
> Priority: Major
>
> The Cassandra connector is currently only available as a Sink implementation
> for DataStream users. We should also make it available as a Source. This
> should be done via Flink's Source API. More details can be found in FLIP-27
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-27%3A+Refactor+Source+Interface
--
This message was sent by Atlassian Jira
(v8.20.10#820010)