ertanden commented on issue #9130: URL: https://github.com/apache/hudi/issues/9130#issuecomment-1623086920
@voonhous thanks for the information, very helpful. Indeed, it would be nice to have this documented. I think the issue can be closed once the documentation is in place. On the other side, I thought that this should be the default behavior. The extra configuration needed seems a bit sketchy. For example it is recommended to have a sortable key etc for clustering to work better, but then to prevent constant file re-writes we need to disable the sort columns? Sorry, I may not have enough information about the internals how clustering works, but just trying to express what makes sense... I just feel that this default behavior is not optimized or even buggy. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
