Danny Chen created HUDI-1753:
--------------------------------

             Summary: Assigns the buckets by record key for Flink writer
                 Key: HUDI-1753
                 URL: https://issues.apache.org/jira/browse/HUDI-1753
             Project: Apache Hudi
          Issue Type: Improvement
          Components: Flink Integration
            Reporter: Danny Chen
             Fix For: 0.9.0


Currently we assign the buckets by record partition path, which could cause 
hotspot if the partition field is datetime type. Actually we can decide the 
buckets by grouping records with their record keys first, the assign is valid 
only if there is no conflict (two task write to same buckets).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to