GitHub user viirya opened a pull request:
https://github.com/apache/spark/pull/20831
[SPARK-23614][SQL] Fix incorrect reuse exchange when caching is used
## What changes were proposed in this pull request?
We should provide customized canonicalize plan for `InMemoryRelation` and
`InMemoryTableScanExec`. Otherwise, we can wrongly treat two different cached
plans as same result. It causes wrongly reused exchange then.
## How was this patch tested?
Added unit test.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/viirya/spark-1 SPARK-23614
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20831.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20831
----
commit e1f28e2c6c9ba99c92fea339946323c9490062d4
Author: Liang-Chi Hsieh <viirya@...>
Date: 2018-03-15T06:16:22Z
Fix incorrect reuse exchange when caching is used.
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]