Vinish Reddy created HUDI-8071:
----------------------------------
Summary: Handle skew for user defined sort columns in BULK_INSERT
Key: HUDI-8071
URL: https://issues.apache.org/jira/browse/HUDI-8071
Project: Apache Hudi
Issue Type: Improvement
Components: deltastreamer, writer-core
Reporter: Vinish Reddy
If there is a skew in user defined columns for sortKey, spark sort reduces the
number of tasks and this leads to an increase in contention when writing
parquet files.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)