[ 
https://issues.apache.org/jira/browse/HUDI-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Danny Chen closed HUDI-1753.
----------------------------
    Assignee: Danny Chen

> Assigns the buckets by record key for Flink writer
> --------------------------------------------------
>
>                 Key: HUDI-1753
>                 URL: https://issues.apache.org/jira/browse/HUDI-1753
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: Flink Integration
>            Reporter: Danny Chen
>            Assignee: Danny Chen
>            Priority: Major
>
> Currently we assign the buckets by record partition path, which could cause 
> hotspot if the partition field is datetime type. Actually we can decide the 
> buckets by grouping records with their record keys first, the assign is valid 
> only if there is no conflict (two task write to same buckets).



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to