[GitHub] [iceberg] wypoon opened a new pull request #1508: Use schema at the time of the snapshot when reading a snapshot.

GitBox Thu, 24 Sep 2020 17:54:09 -0700


wypoon opened a new pull request #1508:
URL: https://github.com/apache/iceberg/pull/1508



   Iceberg uses the current schema of a table when reading an older snapshot of 
the table.
   This change makes Iceberg use the schema of the table at the time of the 
snapshot instead.
   To enable this, we add two new methods to the `Table` interface, 
`schemaForSnapshot` and `schemaForSnapshotAsOfTime`; the former takes the 
snapshot id and the latter takes a timestamp (millis since the epoch).
   For Spark 2, we change `Reader` to use the appropriate schema if the 
snapshot-id or as-of-timestamp options are used.
   For Spark 3, we change `SparkTable` as well as `SparkScanBuilder` and 
`SparkBatchScan` to use the appropriate schema.
   We add two new unit tests that test the new behavior. They would fail 
without the above changes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

[GitHub] [iceberg] wypoon opened a new pull request #1508: Use schema at the time of the snapshot when reading a snapshot.

Reply via email to