GitHub user gengliangwang opened a pull request:
https://github.com/apache/spark/pull/20933
[SPARK-23817][SQL]Migrate ORC file format read path to data source V2
## What changes were proposed in this pull request?
Migrate ORC file format read path to data source V2.
### Supports:
1. Scan ColumnarBatch
2. Scan UnsafeRow
3. Push down filters
4. Push down required columns
### Not supported( due to limitation of data source V2):
1. Read multiple file path
2. Read bucketed file.
## How was this patch tested?
unit test
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/gengliangwang/spark orcV2
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20933.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20933
----
commit 40b33c30406e00923744c30668056b33c47a2ac1
Author: Wang Gengliang <gengliang.wang@...>
Date: 2018-03-29T08:33:23Z
orc v2
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]