Xingchao, Zhang created SPARK-40868:
---------------------------------------
Summary: Avoid introducing too many partitions when bucketed scan
disabled by sql planner
Key: SPARK-40868
URL: https://issues.apache.org/jira/browse/SPARK-40868
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.4.0
Reporter: Xingchao, Zhang
For bucket tables with huge size, lots of partitions maybe generated if
bucketed scan disabled by sql planner. We can add one limit(default as
BUCKETING_MAX_BUCKETS) to reduce the partitions for non-bucketed scan to avoid
too many tasks introduced in single stage.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]