[
https://issues.apache.org/jira/browse/SPARK-51500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rakesh Kumar updated SPARK-51500:
---------------------------------
Description: (was: Spark at Meta is using DatasourcesV2 for warehouse
integration and certain queries are experiencing performance degradation. This
issue arises from AQE's failure to reuse stages.
The underlying problem lies in the implementation of the BatchScanExec
operator. Specifically, the canonicalization and equality/hashcode functions of
this operator do not reconcile, resulting in the canonicalized BatchScanExec
becoming semantically equal to its non-canonicalized counterpart. This causes
the AQE's stageCache to use the non-canonicalized plan for lookup, which fails
to find existing stages even if a reusable stage exists.
)
> AQE does not reuse exchange/stage when the stage has BatchScanExec operator
> with runtime filters
> ------------------------------------------------------------------------------------------------
>
> Key: SPARK-51500
> URL: https://issues.apache.org/jira/browse/SPARK-51500
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 3.4.1, 3.5.1, 3.4.4
> Reporter: Rakesh Kumar
> Priority: Major
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]