GitHub user gengliangwang opened a pull request:

    https://github.com/apache/spark/pull/20933

    [SPARK-23817][SQL]Migrate ORC file format read path to data source V2

    ## What changes were proposed in this pull request?
    Migrate ORC file format read path to data source V2. 
    
    ### Supports:
    
    1. Scan ColumnarBatch
    2. Scan UnsafeRow
    3. Push down filters
    4. Push down required columns
    
    ### Not supported( due to limitation of data source V2):
    
    1. Read multiple file path
    2. Read bucketed file.
    
    ## How was this patch tested?
    
    unit test

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gengliangwang/spark orcV2

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20933.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20933
    
----
commit 40b33c30406e00923744c30668056b33c47a2ac1
Author: Wang Gengliang <gengliang.wang@...>
Date:   2018-03-29T08:33:23Z

    orc v2

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to