kbuci commented on PR #18958: URL: https://github.com/apache/hudi/pull/18958#issuecomment-4747532553
Thanks for feedback. For now I had created this initial PR to just focues on writing the OOL blob references (since even if read blob support in Flink were not to come for a while, I thought it would be a good step to achieve write parity with Spark in this manner) I created a ticket to track the support for materializing the bytes from OOL blobs during FlinkSQL reads https://github.com/apache/hudi/issues/19032 My initial idea was to find a way to create a `read_blob` function similar to spark, but as discussed in this thread, that would not allow us to "batch" requests. But after reading @cshuo's suggestion I realized we don't necessarily need this since users can pass in configs via DDL(for HUDI/Flink to consume) - so I'll be fully leveraging that to implement this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
