[GitHub] spark pull request #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compres...

maropu Mon, 18 Dec 2017 05:07:02 -0800

Github user maropu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19218#discussion_r157478781
  
    --- Diff: 
sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/SaveAsHiveFile.scala
 ---
    @@ -86,6 +110,19 @@ private[hive] trait SaveAsHiveFile extends 
DataWritingCommand {
           options = Map.empty)
       }
     
    +  // Because compression configurations can come in a variety of ways,
    +  // we choose the compression configuration in this order:
    +  // For parquet: `compression` > `parquet.compression` > 
`spark.sql.parquet.compression.codec`
    +  // For orc: `compression` > `orc.compress` > 
`spark.sql.orc.compression.codec`
    --- End diff --
    
    Is it okay to leave this priority in the spark document or somewhere? 
https://spark.apache.org/docs/latest/sql-programming-guide.html#configuration



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compres...

Reply via email to