GitHub user cloud-fan opened a pull request:
https://github.com/apache/spark/pull/20394
[SPARK-23214][SQL] cached data should not carry extra hint info
## What changes were proposed in this pull request?
This is a regression introduced by
https://github.com/apache/spark/pull/19864
When we lookup cache, we should not carry the hint info, as this cache
entry might be added by a plan having hint info, while the input plan for this
lookup may not have hint info, or have different hint info.
## How was this patch tested?
a new test.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/cloud-fan/spark cache
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20394.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20394
----
commit 87ac064db0fcb1d9ffe5c7f77069c09ba47a9f01
Author: Wenchen Fan <wenchen@...>
Date: 2018-01-25T10:07:26Z
cached data should not carry extra hint info
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]