wypoon opened a new pull request #1508: URL: https://github.com/apache/iceberg/pull/1508
Iceberg uses the current schema of a table when reading an older snapshot of the table. This change makes Iceberg use the schema of the table at the time of the snapshot instead. To enable this, we add two new methods to the `Table` interface, `schemaForSnapshot` and `schemaForSnapshotAsOfTime`; the former takes the snapshot id and the latter takes a timestamp (millis since the epoch). For Spark 2, we change `Reader` to use the appropriate schema if the snapshot-id or as-of-timestamp options are used. For Spark 3, we change `SparkTable` as well as `SparkScanBuilder` and `SparkBatchScan` to use the appropriate schema. We add two new unit tests that test the new behavior. They would fail without the above changes. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org