liuzqt opened a new pull request, #48486: URL: https://github.com/apache/spark/pull/48486
### What changes were proposed in this pull request? Re-cache AQE plan upon failure. ### Why are the changes needed? When we use a cached an AQE plan, it will do `cachedPlan.execute` to build the RDD, which will execute all AQE stages except the result stage. If any of them failed, the failure will be cached by lazy RDD val. So the next time when we reuse that cached plan (even by a totally irrelevant caller) it will fail immediately. We need to re-cache the AQE plan upon failure. ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? new UT ### Was this patch authored or co-authored using generative AI tooling? NO -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
