Github user dongjoon-hyun commented on a diff in the pull request:
https://github.com/apache/spark/pull/20484#discussion_r167710001
--- Diff: docs/sql-programming-guide.md ---
@@ -1776,6 +1776,35 @@ working with timestamps in `pandas_udf`s to get the
best performance, see
## Upgrading From Spark SQL 2.2 to 2.3
+ - Since Spark 2.3, Spark supports a vectorized ORC reader with a new ORC
file format for ORC files. To do that, the following configurations are newly
added or change their default values. For ORC tables, the vectorized reader
will be used for the tables created by `USING ORC`. With
`spark.sql.hive.convertMetastoreOrc=true`, it will for the tables created by
`USING HIVE OPTIONS (fileFormat 'ORC')`, too.
--- End diff --
Thanks!
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]