GitHub user ericl opened a pull request:
https://github.com/apache/spark/pull/15273
[SPARK-17673] [SQL] [WIP] Incorrect exchange reuse with RowDataSourceScan
## What changes were proposed in this pull request?
It seems the equality check for reuse of `RowDataSourceScanExec` nodes
doesn't respect the output schema. This can cause self-joins or unions over the
same underlying data source to return incorrect results if they select
different fields.
## How was this patch tested?
WIP...
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ericl/spark spark-17673
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/15273.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #15273
----
commit c0e08b84c5d600789fbafa38692669022ae6973d
Author: Eric Liang <[email protected]>
Date: 2016-09-28T02:18:20Z
wip
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]