YannByron commented on PR #5885:
URL: https://github.com/apache/hudi/pull/5885#issuecomment-1186545393

   Hey @prasannarajaperumal Thank you very much for reviewing this.
   
   CDC is not a new concept, is a common concept for database. So I think it's 
better to distinguish CDC and `Incremental Query`. Some reasons: 
   - CDC is better known than incremental query. `incremental query` is defined 
by hudi.
   - Different from `Incremental Query` and `Snapshot Query`, CDC has its own 
output format in which every record have `op`, `ts_ms`, `before` and `after` 
fields.
   - According to RFC-51, CDC has its own read and write logical. We have to 
persist some other information for CDC when data is written to hudi.
   
   Looking forward to your reply.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to