Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/19470#discussion_r144457643
--- Diff:
sql/hive/src/main/scala/org/apache/spark/sql/hive/orc/OrcFileFormat.scala ---
@@ -138,8 +138,7 @@ class OrcFileFormat extends FileFormat with
DataSourceRegister with Serializable
if (maybePhysicalSchema.isEmpty) {
Iterator.empty
} else {
- val physicalSchema = maybePhysicalSchema.get
- OrcRelation.setRequiredColumns(conf, physicalSchema,
requiredSchema)
+ OrcRelation.setRequiredColumns(conf, dataSchema, requiredSchema)
--- End diff --
oh i see, we only need to pass the required column indices to orc reader.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]