[
https://issues.apache.org/jira/browse/SPARK-51500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rakesh Kumar updated SPARK-51500:
---------------------------------
Summary: AQE does not reuse exchange/stage when the stage has BatchScanExec
operator with runtime filters (was: AQE does not reuse exchange/stage when the
stage has BatchScanExec operator)
> AQE does not reuse exchange/stage when the stage has BatchScanExec operator
> with runtime filters
> ------------------------------------------------------------------------------------------------
>
> Key: SPARK-51500
> URL: https://issues.apache.org/jira/browse/SPARK-51500
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 3.4.1, 3.5.1, 3.4.4
> Reporter: Rakesh Kumar
> Priority: Major
>
> Spark at Meta is using DatasourcesV2 for warehouse integration and certain
> queries are experiencing performance degradation. This issue arises from
> AQE's failure to reuse stages.
> The underlying problem lies in the implementation of the BatchScanExec
> operator. Specifically, the canonicalization and equality/hashcode functions
> of this operator do not reconcile, resulting in the canonicalized
> BatchScanExec becoming semantically equal to its non-canonicalized
> counterpart. This causes the AQE's stageCache to use the non-canonicalized
> plan for lookup, which fails to find existing stages even if a reusable stage
> exists.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]