GitHub user dongjoon-hyun opened a pull request:
https://github.com/apache/spark/pull/20517
[SPARK-23342][SQL][TEST] Add ORC configuration tests for ORC data source
## What changes were proposed in this pull request?
This issue adds test coverage for ORC configuration with ORC names and Hive
names.
**Example:**
- orc.stripe.size, hive.exec.orc.default.stripe.size
- orc.row.index.stride, hive.exec.orc.default.row.index.stride
- orc.compress.size, hive.exec.orc.default.buffer.size
Note that these tests are added to both `OrcSourceSuite` and
`HiveOrcSourceSuite` because both Spark `native` and `hive` implementation
supports them. The followings are the related code.
- Apache ORC 1.4.1
[OrcConf.java](https://github.com/apache/orc/blob/master/java/core/src/java/org/apache/orc/OrcConf.java#L29)
- Hive 1.2.1
[OrcFile.java](https://github.com/JoshRosen/hive/blob/release-1.2.1-spark2/ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcFile.java#L151-L160)
[HiveConf.java](https://github.com/JoshRosen/hive/blob/release-1.2.1-spark2/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java)
## How was this patch tested?
Pass the newly added test cases.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/dongjoon-hyun/spark SPARK-23342
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20517.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20517
----
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]