YannByron commented on PR #5885: URL: https://github.com/apache/hudi/pull/5885#issuecomment-1186545393
Hey @prasannarajaperumal Thank you very much for reviewing this. CDC is not a new concept, is a common concept for database. So I think it's better to distinguish CDC and `Incremental Query`. Some reasons: - CDC is better known than incremental query. `incremental query` is defined by hudi. - Different from `Incremental Query` and `Snapshot Query`, CDC has its own output format in which every record have `op`, `ts_ms`, `before` and `after` fields. - According to RFC-51, CDC has its own read and write logical. We have to persist some other information for CDC when data is written to hudi. Looking forward to your reply. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
