gustavoatt opened a new pull request #1184:
URL: https://github.com/apache/iceberg/pull/1184


   ## Summary
   
   Add read support for Parquet INT96 timestamps (fixes 
https://github.com/apache/iceberg/issues/1138). This is needed so that parquet 
files written by Spark, that used INT96 timestamps, are able to be read by 
Iceberg without having to rewrite these files. This is specially useful for 
migrations.
   
   https://github.com/apache/parquet-format/pull/49 has more information about 
how parquet int96 timestamps are stored. Note that I only implemented read 
support since this representation has many issues (as visible in the 
conversation in the `parquet-format` PR).
   
   ## Testing
   
   - [x] Added unit test for spark readers
   - [ ] Unsure about what is the best place to add unit-tests for the 
non-spark parquet readers. Would gladly add one.
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to