flyrain commented on issue #7822:
URL: https://github.com/apache/iceberg/issues/7822#issuecomment-1629416208

   The cdc procedure is based on the changelog table. The changelog table 
reader reads rows with the current schema even schemas may be different across 
multiple snapshots. In that case, it either throws extra columns or return null 
for missing columns. I think the simplest way now is to check if schemas of all 
snapshots in a given range are the same, and throw if not.  We can think about 
a better way to handle different schemas, but I feel it is harder to handle 
different schema within one procedure call. Users may have to handle that 
situation by themselves. cc @aokolnychyi 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to