weiming created HUDI-5315:
-----------------------------

             Summary: The record size is dynamically estimated when the table 
is first written
                 Key: HUDI-5315
                 URL: https://issues.apache.org/jira/browse/HUDI-5315
             Project: Apache Hudi
          Issue Type: Improvement
          Components: spark-sql, writer-core
            Reporter: weiming
            Assignee: weiming
             Fix For: 0.12.2


Although hudi has the function of dynamically estimating the size of the 
record, but it can only take effect if certain conditions are met, when the 
user commits for the first time, the default is to use 
[hoodie.copyonwrite.record.size.estimate = 1024], if the amount of data for the 
first commit is very large, and the user 
[hoodie.copyonwrite.record.size.estimate] parameter setting is not reasonable, 
it will lead to a lot of small files.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to