nsivabalan commented on issue #4027: URL: https://github.com/apache/hudi/issues/4027#issuecomment-1002678785
Hey, I tried to reproduce locally and could not. https://gist.github.com/nsivabalan/7d6ea90ebfa76f9a53abedfa562562b7 can you confirm few things: 1. is your table MOR? 2. If yes, do you have any file groups with any base files but just log files? From the code, I see hudi clustering considers only parquet file size and not the log file sizes. 3. Can you enable info logging and let us know what you see for "Adding one clustering group" -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
