CalvinKirs opened a new issue, #2394:
URL: https://github.com/apache/incubator-seatunnel/issues/2394

   Change data capture (CDC) refers to the process of identifying and capturing 
changes made to data in a database and then delivering those changes in 
real-time to a downstream process or system.
   
   CDC is mainly divided into two ways: query-based and Binlog-based.
   We know that MySQL has binlog (binary log) to record the user's changes to 
the database, so it is logical that one of the simplest and most efficient CDC 
implementations can be done using binlog. Of course, there are already many 
open source MySQL CDC implementations that work out of the box. Using binlog is 
not the only way to implement CDC (at least for MySQL), even database triggers 
can perform similar functions, but they may be dwarfed in terms of efficiency 
and impact on the database.
   
   Typically, after a CDC captures changes to a database, it will publish the 
change events to a message queue for consumers to consume, such as Debezium, 
which persists MySQL (and also supports PostgreSQL, Mongo, etc.) changes to 
Kafka, and by subscribing to the events in Kafka, we can get the content of the 
changes and implement the functionality we need.
   
   And as data synchronization, I think we need to support CDC as a feature, 
and I want to hear from you all how you think it can be implemented in 
SeaTunnel.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to