GitHub user dongjoon-hyun opened a pull request:

    https://github.com/apache/spark/pull/20634

    [SPARK-23456][SPARK-21783] Turn on `native` ORC impl and PPD by default

    ## What changes were proposed in this pull request?
    
    Apache Spark 2.3 introduced `native` ORC supports with vectorization and 
many fixes. However, it's shipped as a not-default option. This PR enables 
`native` ORC implementation and predicate-pushdown by default for Apache Spark 
2.4. We will improve and stabilize ORC data source before Apache Spark 2.4. 
And, eventually, Apache Spark will drop old Hive-based ORC code.
    
    ## How was this patch tested?
    
    Pass the Jenkins with existing tests.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/dongjoon-hyun/spark SPARK-23456

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20634.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20634
    
----
commit bde681816bc8251010aa14fd0f4fb27b732fd061
Author: Dongjoon Hyun <dongjoon@...>
Date:   2018-02-17T16:56:31Z

    [SPARK-23456][SPARK-21783] Turn on `native` ORC impl and PPD by default

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to