[GitHub] spark pull request #20484: [SPARK-23313][DOC] Add a migration guide for ORC

dongjoon-hyun Mon, 12 Feb 2018 14:41:25 -0800

Github user dongjoon-hyun commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20484#discussion_r167710001
  
    --- Diff: docs/sql-programming-guide.md ---
    @@ -1776,6 +1776,35 @@ working with timestamps in `pandas_udf`s to get the 
best performance, see
     
     ## Upgrading From Spark SQL 2.2 to 2.3
     
    +  - Since Spark 2.3, Spark supports a vectorized ORC reader with a new ORC 
file format for ORC files. To do that, the following configurations are newly 
added or change their default values. For ORC tables, the vectorized reader 
will be used for the tables created by `USING ORC`. With 
`spark.sql.hive.convertMetastoreOrc=true`, it will for the tables created by 
`USING HIVE OPTIONS (fileFormat 'ORC')`, too.
    --- End diff --
    
    Thanks!



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #20484: [SPARK-23313][DOC] Add a migration guide for ORC

Reply via email to