XiDuo You created SPARK-35961:
---------------------------------
Summary: Only use local shuffle reader for
REBALANCE_PARTITIONS_BY_NONE without CustomShuffleReaderExec
Key: SPARK-35961
URL: https://issues.apache.org/jira/browse/SPARK-35961
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.2.0
Reporter: XiDuo You
After [SPARK-35725](https://issues.apache.org/jira/browse/SPARK-35725), we
might expand partition if that partition is skewed. So the partition number
check `bytesByPartitionId.length == partitionSpecs.size` would be wrong if some
partitions are coalesced and some partitions
are splitted into smaller (it's unlikely in real world since it used
RoundRobin).
Otherhand, after
[SPARK-34899](https://issues.apache.org/jira/browse/SPARK-34899), we use origin
plan if can not coalesce partitions. So the assuming of that shuffle stage has
`CustomShuffleReaderExec` with no effect is always false. That said, if no rule
is efficient, there would be no `CustomShuffleReaderExec`.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]