[GitHub] spark issue #22255: [SPARK-25102][Spark Core] Write Spark version informatio...

dongjoon-hyun Fri, 02 Nov 2018 00:16:43 -0700

Github user dongjoon-hyun commented on the issue:

    https://github.com/apache/spark/pull/22255
  
    Currently, we put the metadata like the following.
    ```
    file:        
file:/tmp/p/part-00005-dbb9a9ab-0d6a-49df-9f39-397c8505f22b-c000.snappy.parquet
    creator:     parquet-mr version 1.10.0 (build 
031a6654009e3b82020012a18434c582bd74c73a)
    extra:       org.apache.spark.sql.parquet.row.metadata = 
{"type":"struct","fields":[{"name":"id","type":"long","nullable":false,"metadata":{}}]}
    ```
    
    For the hive table, it looks like the following. So, I prefer to add 
`spark.sql.create.version=2.4.0` to `key_value_metadata`. I'll make a PR in 
this way.
    ```
    
parameters:{spark.sql.sources.schema.part.0={"type":"struct","fields":[{"name":"a","type":"integer","nullable":true,"metadata":{}}]},
 transient_lastDdlTime=1541142761, spark.sql.sources.schema.numParts=1, 
spark.sql.create.version=2.4.0}
    ```



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #22255: [SPARK-25102][Spark Core] Write Spark version informatio...

Reply via email to