geserdugarov commented on PR #12697: URL: https://github.com/apache/hudi/pull/12697#issuecomment-2639950653
@danny0405 , @cshuo , I've updated description here with added structure of `HoodieFlinkRecordTypeInfo ` and `HoodieFlinkRecordSerializer`. I've already implemented optimization for simple bucket index and non bucket case. Opened corresponding PR: https://github.com/apache/hudi/pull/12796. First version, that I could do everything without custom serializer was wrong. I've faced issues with serde during conversion of `DataStream` into `KeyedStream`. With implemented `HoodieFlinkRecordTypeInfo ` and `HoodieFlinkRecordSerializer` everything works correctly. And for non bucket case I got 31% performance improvement. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
