GitHub user cloud-fan opened a pull request:
https://github.com/apache/spark/pull/20513
[SPARK-23312][SQL][followup] add a config to turn off vectorized cache
reader
## What changes were proposed in this pull request?
https://github.com/apache/spark/pull/20483 tried to provide a way to turn
off the new columnar cache reader, to restore the behavior in 2.2. However even
we turn off that config, the behavior is still different than 2.2.
If the output data are rows, we still enable whole stage codegen for the
scan node, which is different with 2.2, we should also fix it.
## How was this patch tested?
existing tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/cloud-fan/spark cache
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20513.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20513
----
commit 8525b2c7e540991c75c8d61bfc5a8361cae78c7b
Author: Wenchen Fan <wenchen@...>
Date: 2018-02-06T04:17:03Z
followup
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]