Timothy Brown created HUDI-8673:
-----------------------------------

             Summary: Update Simple and Global-Simple Index parallelism to 
default to input files
                 Key: HUDI-8673
                 URL: https://issues.apache.org/jira/browse/HUDI-8673
             Project: Apache Hudi
          Issue Type: Improvement
            Reporter: Timothy Brown


The default parallelism while using the Simple and Global-Simple index is equal 
to the number of input partitions for the incoming data. This means that a 
small update against a large table may be throttled to a single spark partition 
with the current defaults. Instead, we can make the default based off of the 
data that needs to be read.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to