codope opened a new pull request #3875:
URL: https://github.com/apache/hudi/pull/3875


   ## What is the purpose of the pull request
   
   In order to support better out-of-box clustering configs, we need to have 
better default values for those configs. Most of them already have a good 
default value, this PR changes defaut for two configs: 
   - Clustering plan strategy class: The previously set default value made 
certain assumptions on the partition path like it should be day-based 
partition. The new default is purely file size based. 
   - Enable commit metadata preservation: Previously, it was disabled by 
default. Enanling it by default will be useful for incremental query support 
with replace commits in the timeline.
   
   On ease of use: 
   
   Now user only need to enable clustering. That's it! Though it is advisable 
to set sort columns to cluster by to make use of the default clustering 
execution strategy. 
   
   ## Committer checklist
   
    - [ ] Has a corresponding JIRA in PR title & commit
    
    - [ ] Commit message is descriptive of the change
    
    - [ ] CI is green
   
    - [ ] Necessary doc changes done or have another open PR
          
    - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to