YuweiXiao commented on issue #5107: URL: https://github.com/apache/hudi/issues/5107#issuecomment-1078579124
@qjqqyy Tagging also includes the indexing costs. Could you check again and make sure you clean up the table between tests (or use INSERT rather than UPSERT)? In my previous experiments, I did the same change but the cost still remained. Because even we initialize `AvroSerializer ` once, the internal lambda function will be call for each row, and the local variables used in the lambda will be initialized for each row. @danny0405 Let me try this. I did meet some NonSerializable Exception in my previous experiments, so I guess it may not be easy to directly copy it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
