[GitHub] spark issue #19250: [SPARK-12297] Table timezone correction for Timestamps

cloud-fan Wed, 08 Nov 2017 03:58:04 -0800

Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/19250
  
    IIUC, using the `parquet.timezone-adjustment` table property requires 
changing the writer. e.g. Impala creates a table and Hive wants to write data 
to it, then Hive needs to write timezone-agnostic timestamp instead of UTC 
timestamp. And vice versa.
    
    I think a better solution is the Parquet-specific approach. Impala keeps 
writing timezone-agnostic timestamp, and Spark/Hive keep writing UTC timestamp. 
At the read path, we should detect who wrote the parquet file and do timestamp 
adjustment based on this information(can also have a config to protect this 
behavior).
    
    cc @rxin



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #19250: [SPARK-12297] Table timezone correction for Timestamps

Reply via email to