Timothy Brown created HUDI-8673:
-----------------------------------
Summary: Update Simple and Global-Simple Index parallelism to
default to input files
Key: HUDI-8673
URL: https://issues.apache.org/jira/browse/HUDI-8673
Project: Apache Hudi
Issue Type: Improvement
Reporter: Timothy Brown
The default parallelism while using the Simple and Global-Simple index is equal
to the number of input partitions for the incoming data. This means that a
small update against a large table may be throttled to a single spark partition
with the current defaults. Instead, we can make the default based off of the
data that needs to be read.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)