[GitHub] spark pull request #19769: [SPARK-12297][SQL] Adjust timezone for int96 data...

squito Mon, 20 Nov 2017 09:34:02 -0800

Github user squito commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19769#discussion_r152058935
  
    --- Diff: 
sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/VectorizedColumnReader.java
 ---
    @@ -298,7 +304,10 @@ private void decodeDictionaryIds(
                 // TODO: Convert dictionary of Binaries to dictionary of Longs
                 if (!column.isNullAt(i)) {
                   Binary v = 
dictionary.decodeToBinary(dictionaryIds.getDictId(i));
    -              column.putLong(i, 
ParquetRowConverter.binaryToSQLTimestamp(v));
    +              long rawTime = ParquetRowConverter.binaryToSQLTimestamp(v);
    +              long adjTime =
    +                  convertTz == null ? rawTime : 
DateTimeUtils.convertTz(rawTime, convertTz, UTC);
    --- End diff --
    
    good idea.  I'll push that check further up, so UTC is replaced by null, so 
we do simpler checks inside here



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #19769: [SPARK-12297][SQL] Adjust timezone for int96 data...

Reply via email to