Danny Chen created HUDI-1753:
--------------------------------
Summary: Assigns the buckets by record key for Flink writer
Key: HUDI-1753
URL: https://issues.apache.org/jira/browse/HUDI-1753
Project: Apache Hudi
Issue Type: Improvement
Components: Flink Integration
Reporter: Danny Chen
Fix For: 0.9.0
Currently we assign the buckets by record partition path, which could cause
hotspot if the partition field is datetime type. Actually we can decide the
buckets by grouping records with their record keys first, the assign is valid
only if there is no conflict (two task write to same buckets).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)