hudi-bot opened a new issue, #15384: URL: https://github.com/apache/hudi/issues/15384
For the case that query cdc on MOR tables, the initial implementation use the `Write-on-indexing` way to extract the cdc data by merging the base file and log files in-flight. This ticket wants to support the `Write-on-compaction` way to get the cdc data just by reading the persisted cdc files which are written at the compaction operation. ## JIRA info - Link: https://issues.apache.org/jira/browse/HUDI-4705 - Type: New Feature --- ## Comments 24/May/23 06:38;lizhiqiang;Does this Jira have plans for the next step? [[email protected]] ;;; --- 07/Jun/24 16:41;xushiyan;[~lizhiqiang] [[email protected]] to clarify, CDC for spark works on MOR, just that the implementation is using write-on-indexing strategy (ref: [https://github.com/apache/hudi/blob/master/rfc/rfc-51/rfc-51.md#persisting-cdc-in-mor-write-on-indexing-vs-write-on-compaction)] We want to unify the implementation as write-on-compaction, which allows flink writer to work too. (write-on-indexing strategy does not work for flink as explained in the RFC);;; --- 14/Nov/24 02:35;wangchao316;hi, flink write cdc for MOR , which is not support ?;;; -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
