GitHub user maryannxue opened a pull request:
https://github.com/apache/spark/pull/21602
[SPARK-24613][SQL] Cache with UDF could not be matched with subsequent
dependent caches
## What changes were proposed in this pull request?
Wrap the logical plan with a `AnalysisBarrier` for execution plan
compilation in CacheManager, in order to avoid the plan being analyzed again.
## How was this patch tested?
Add one test in `DatasetCacheSuite`
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/maryannxue/spark cache-mismatch
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/21602.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #21602
----
commit 4a5c388693159c22e69c29da32a4c0c766d30249
Author: Maryann Xue <maryannxue@...>
Date: 2018-06-20T23:18:50Z
[SPARK-24613] Cache with UDF could not be matched with subsequent dependent
caches
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]