xushiyan commented on code in PR #6256: URL: https://github.com/apache/hudi/pull/6256#discussion_r979374427
########## rfc/rfc-51/rfc-51.md: ########## @@ -62,73 +63,79 @@ We follow the debezium output format: four columns as shown below - u: represent `update`; when `op` is `u`, both `before` and `after` don't be null; - d: represent `delete`; when `op` is `d`, `after` is always null; -Note: the illustration here ignores all the Hudi metadata columns like `_hoodie_commit_time` in `before` and `after` columns. +**Note** -## Goals +* In case of the same record having operations like insert -> delete -> insert, CDC data should be produced to reflect the exact behaviors. +* The illustration above ignores all the Hudi metadata columns like `_hoodie_commit_time` in `before` and `after` columns. -1. Support row-level CDC records generation and persistence; -2. Support both MOR and COW tables; -3. Support all the write operations; -4. Support Spark DataFrame/SQL/Streaming Query; +## Design Goals -## Implementation +1. Support row-level CDC records generation and persistence +2. Support both MOR and COW tables +3. Support all the write operations +4. Support incremental queries in CDC format across supported engines Review Comment: done -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
