ertanden commented on issue #9130:
URL: https://github.com/apache/hudi/issues/9130#issuecomment-1623086920

   @voonhous thanks for the information, very helpful. Indeed, it would be nice 
to have this documented. I think the issue can be closed once the documentation 
is in place.
   
   On the other side, I thought that this should be the default behavior. The 
extra configuration needed seems a bit sketchy. For example it is recommended 
to have a sortable key etc for clustering to work better, but then to prevent 
constant file re-writes we need to disable the sort columns? 
   
   Sorry, I may not have enough information about the internals how clustering 
works, but just trying to express what makes sense... I just feel that this 
default behavior is not optimized or even buggy.
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to