weiming created HUDI-5315:
-----------------------------
Summary: The record size is dynamically estimated when the table
is first written
Key: HUDI-5315
URL: https://issues.apache.org/jira/browse/HUDI-5315
Project: Apache Hudi
Issue Type: Improvement
Components: spark-sql, writer-core
Reporter: weiming
Assignee: weiming
Fix For: 0.12.2
Although hudi has the function of dynamically estimating the size of the
record, but it can only take effect if certain conditions are met, when the
user commits for the first time, the default is to use
[hoodie.copyonwrite.record.size.estimate = 1024], if the amount of data for the
first commit is very large, and the user
[hoodie.copyonwrite.record.size.estimate] parameter setting is not reasonable,
it will lead to a lot of small files.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)