Yes! I’m really looking forward to trying this out. The CEP looks really well thought out. I think this will make CDC a lot more useful for a lot of teams.
Jon On Fri, Sep 27, 2024 at 4:23 PM Josh McKenzie <jmcken...@apache.org> wrote: > Really excited to see this hit the ML James. > > As author of the base CDC (get your stones ready for throwing :D) and > someone moderately involved in the CEP here, definitely welcome any > questions. CDC is a *thorny* *problem *in a multi-replica distributed > system like this. > > On Fri, Sep 27, 2024, at 5:40 PM, James Berragan wrote: > > Hi everyone, > > Wiki: > https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-44%3A+Kafka+integration+for+Cassandra+CDC+using+Sidecar > > We would like to propose this CEP for adoption by the community. > > CDC is a common technique in databases but right now there is no > out-of-the-box solution to do this easily and at scale with Cassandra. Our > proposal is to build a fully-fledged solution into the Apache Cassandra > Sidecar. This comes with a number of benefits: > - Sidecar is an official part of the existing Cassandra eco-system. > - Sidecar runs co-located with Cassandra instances and so scales with the > cluster size. > - Sidecar can access the underlying Cassandra database to store CDC > configuration and the CDC state in a special table. > - Running in the Sidecar does not require additional external resources to > run. > > The core CDC module we anticipate will be pluggable and re-usable, it is > available for review here: > https://github.com/apache/cassandra-analytics/pull/87. The remaining > Sidecar code will follow. > > As a reminder, please keep the discussion here on the dev list vs. in the > wiki, as we’ve found it easier to manage via email. > > Sincerely, > James Berragan > Bernardo Botella Corbi > Yifan Cai > Jyothsna Konisa > > >