[GitHub] spark pull request #19943: [SPARK-16060][SQL] Support Vectorized ORC Reader

dongjoon-hyun Tue, 12 Dec 2017 11:41:17 -0800

Github user dongjoon-hyun commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19943#discussion_r156474306
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala
 ---
    @@ -139,15 +146,25 @@ class OrcFileFormat
           }
         }
     
    +    val resultSchema = StructType(requiredSchema.fields ++ 
partitionSchema.fields)
    +    val enableVectorizedReader = 
sparkSession.sessionState.conf.orcVectorizedReaderEnabled &&
    +      supportBatch(sparkSession, resultSchema)
    --- End diff --
    
    Ur, it seems to make the implementation complicated in order to provide 
slower version.
    If you don't mind, I'll proceed that in a next follow-up PR.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #19943: [SPARK-16060][SQL] Support Vectorized ORC Reader

Reply via email to