codope opened a new pull request #3875:
URL: https://github.com/apache/hudi/pull/3875
## What is the purpose of the pull request
In order to support better out-of-box clustering configs, we need to have
better default values for those configs. Most of them already have a good
default value, this PR changes defaut for two configs:
- Clustering plan strategy class: The previously set default value made
certain assumptions on the partition path like it should be day-based
partition. The new default is purely file size based.
- Enable commit metadata preservation: Previously, it was disabled by
default. Enanling it by default will be useful for incremental query support
with replace commits in the timeline.
On ease of use:
Now user only need to enable clustering. That's it! Though it is advisable
to set sort columns to cluster by to make use of the default clustering
execution strategy.
## Committer checklist
- [ ] Has a corresponding JIRA in PR title & commit
- [ ] Commit message is descriptive of the change
- [ ] CI is green
- [ ] Necessary doc changes done or have another open PR
- [ ] For large changes, please consider breaking it into sub-tasks under
an umbrella JIRA.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]