[
https://issues.apache.org/jira/browse/HUDI-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Danny Chen updated HUDI-1757:
-----------------------------
Description: Currently we assign the buckets by record partition path
which could cause hotspot if the partition field is datetime type. Changes to
assign buckets by grouping the record whth their key first, the assignment is
valid if only there is no conflict(two task write to the same bucket).
> Assigns the buckets by record key for Flink writer
> --------------------------------------------------
>
> Key: HUDI-1757
> URL: https://issues.apache.org/jira/browse/HUDI-1757
> Project: Apache Hudi
> Issue Type: Improvement
> Components: Flink Integration
> Reporter: Danny Chen
> Assignee: Danny Chen
> Priority: Major
>
> Currently we assign the buckets by record partition path which could cause
> hotspot if the partition field is datetime type. Changes to assign buckets by
> grouping the record whth their key first, the assignment is valid if only
> there is no conflict(two task write to the same bucket).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)