[DISCUSS] CEP-44: Kafka integration for Cassandra CDC using Sidecar

James Berragan Fri, 27 Sep 2024 14:42:01 -0700

Hi everyone,

Wiki:
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-44%3A+Kafka+integration+for+Cassandra+CDC+using+Sidecar


We would like to propose this CEP for adoption by the community.

CDC is a common technique in databases but right now there is no
out-of-the-box solution to do this easily and at scale with Cassandra. Our
proposal is to build a fully-fledged solution into the Apache Cassandra
Sidecar. This comes with a number of benefits:
- Sidecar is an official part of the existing Cassandra eco-system.
- Sidecar runs co-located with Cassandra instances and so scales with the
cluster size.
- Sidecar can access the underlying Cassandra database to store CDC
configuration and the CDC state in a special table.
- Running in the Sidecar does not require additional external resources to
run.

The core CDC module we anticipate will be pluggable and re-usable, it is
available for review here:
https://github.com/apache/cassandra-analytics/pull/87. The remaining
Sidecar code will follow.

As a reminder, please keep the discussion here on the dev list vs. in the
wiki, as we’ve found it easier to manage via email.

Sincerely,
James Berragan
Bernardo Botella Corbi
Yifan Cai
Jyothsna Konisa

[DISCUSS] CEP-44: Kafka integration for Cassandra CDC using Sidecar

Reply via email to