guanziyue commented on PR #6612: URL: https://github.com/apache/hudi/pull/6612#issuecomment-1244835989
Hi loukey-lj, excited to hear a fantastic idea. May I know if you have tried part of this idea? For example, updating parquet file actually is not bounded to hudi framework. We can have a unit test by directly rewrite a file only depending on parquet api. As far as I know, parquet file requests schema to be unique among all row groups. Do we have a mechanism to solve this once the row group we write in latest commit has an evolved or devolved schema? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
