[I] Support Write-on-compaction mode when query cdc on MOR tables [hudi]

via GitHub Sat, 29 Nov 2025 21:47:08 -0800


hudi-bot opened a new issue, #15384:
URL: https://github.com/apache/hudi/issues/15384


   For the case that query cdc on MOR tables, the initial implementation use 
the `Write-on-indexing`  way to extract the cdc data by merging the base file 
and log files in-flight.
   
   This ticket wants to support the `Write-on-compaction` way to get the cdc 
data just by reading the persisted cdc files which are written at the 
compaction operation.
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-4705
   - Type: New Feature
   
   
   ---
   
   
   ## Comments
   
   24/May/23 06:38;lizhiqiang;Does this Jira have plans for the next step?
   
   [[email protected]] ;;;
   
   ---
   
   07/Jun/24 16:41;xushiyan;[~lizhiqiang] [[email protected]] to clarify, 
CDC for spark works on MOR, just that the implementation is using 
write-on-indexing strategy (ref: 
[https://github.com/apache/hudi/blob/master/rfc/rfc-51/rfc-51.md#persisting-cdc-in-mor-write-on-indexing-vs-write-on-compaction)]
   
    
   
   We want to unify the implementation as write-on-compaction, which allows 
flink writer to work too. (write-on-indexing strategy does not work for flink 
as explained in the RFC);;;
   
   ---
   
   14/Nov/24 02:35;wangchao316;hi， flink write cdc for MOR , which is not 
support ?;;;


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] Support Write-on-compaction mode when query cdc on MOR tables [hudi]

Reply via email to