[
https://issues.apache.org/jira/browse/SPARK-47294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
angerszhu updated SPARK-47294:
------------------------------
Description:
Current OptimizeSkewInRebalanceRepartitions only support match
ShuffleQueryStageExec, this case only support SQL query, can't work when insert
since there have a project between ShuffleQueryStageExec and insert command
{code:java}
plan transformUp {
case p @ ProjectExec(_, stage: ShuffleQueryStageExec) if
isSupported(stage.shuffle) =>
p.copy(child = tryOptimizeSkewedPartitions(stage))
case stage: ShuffleQueryStageExec if isSupported(stage.shuffle) =>
tryOptimizeSkewedPartitions(stage)
} {code}
> OptimizeSkewInRebalanceRepartitions should support
> ProjectExec(_,ShuffleQueryStageExec)
> ---------------------------------------------------------------------------------------
>
> Key: SPARK-47294
> URL: https://issues.apache.org/jira/browse/SPARK-47294
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 4.0.0, 3.5.1
> Reporter: angerszhu
> Priority: Major
>
> Current OptimizeSkewInRebalanceRepartitions only support match
> ShuffleQueryStageExec, this case only support SQL query, can't work when
> insert since there have a project between ShuffleQueryStageExec and insert
> command
> {code:java}
> plan transformUp {
> case p @ ProjectExec(_, stage: ShuffleQueryStageExec) if
> isSupported(stage.shuffle) =>
> p.copy(child = tryOptimizeSkewedPartitions(stage))
> case stage: ShuffleQueryStageExec if isSupported(stage.shuffle) =>
> tryOptimizeSkewedPartitions(stage)
> } {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]