[ 
https://issues.apache.org/jira/browse/SPARK-51500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rakesh Kumar updated SPARK-51500:
---------------------------------
    Summary: AQE does not reuse exchange/stage when the stage has BatchScanExec 
operator with runtime filters  (was: AQE does not reuse exchange/stage when the 
stage has BatchScanExec operator)

> AQE does not reuse exchange/stage when the stage has BatchScanExec operator 
> with runtime filters
> ------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-51500
>                 URL: https://issues.apache.org/jira/browse/SPARK-51500
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 3.4.1, 3.5.1, 3.4.4
>            Reporter: Rakesh Kumar
>            Priority: Major
>
> Spark at Meta is using DatasourcesV2 for warehouse integration and certain 
> queries are experiencing performance degradation. This issue arises from 
> AQE's failure to reuse stages.
> The underlying problem lies in the implementation of the BatchScanExec 
> operator. Specifically, the canonicalization and equality/hashcode functions 
> of this operator do not reconcile, resulting in the canonicalized 
> BatchScanExec becoming semantically equal to its non-canonicalized 
> counterpart. This causes the AQE's stageCache to use the non-canonicalized 
> plan for lookup, which fails to find existing stages even if a reusable stage 
> exists. 
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to