Hi everyone, We found a change in default behavior for the data file location in the new 0.9.0 release. The `ObjectStoreLocationProvider` was turned on by default whereas it was off and not available in previous versions. Previous versions will write to `data/` directory, whereas the new version will write to `data/<hash>/` directory. I created an issue at https://github.com/apache/iceberg-python/issues/1721 and the fix at https://github.com/apache/iceberg-python/pull/1722 I'll follow up and cut a new RC.
Thanks, Kevin Liu On Tue, Feb 25, 2025 at 1:43 PM Fokko Driesprong <fo...@apache.org> wrote: > +1 (binding) > > - Ran signatures/checksum/licenses checks > - Ran tests locally (3.10) > - Did some local testing > > Thanks for running this release Drew! > > Kind regards, > Fokko > > Op di 25 feb 2025 om 07:20 schreef Kevin Liu <kevinjq...@apache.org>: > >> +1 (non-binding) >> >> Downloaded from SVN >> * gpg: Good signature from "Drew Gallardo <img...@gmail.com>" [unknown] >> * Checksum OK >> * RAT checks passed. 1 extra ERROR line but is unrelated >> ``` >> ERROR: Ignored 0 lines in your exclusion files as comments or empty lines. >> ``` >> * Both unit and integration test passed with Python 3.12.8 on M1 and >> Orbstack >> >> BTW, email for RC1 was not sent because we found an issue right away. >> More details can be found in this devlist thread, >> https://lists.apache.org/thread/rr8lcf96jl6079dz6vfkwr5spbvlxzpm >> >> Thanks for driving the release, Drew! >> >> Best, >> Kevin Liu >> >> >> On Mon, Feb 24, 2025 at 1:36 PM Drew <img...@gmail.com> wrote: >> >>> Hi Everyone, >>> >>> I propose that we release the following RC as the official PyIceberg >>> 0.9.0 release. >>> >>> A summary of the high level features: >>> >>> - 235 new commits >>> >>> *High Level Features:* >>> >>> - Implemented support for Alibaba OSS protocol in PyArrowFileIO >>> - Enabled Dynamic Overwrite capability >>> - Introduced V3 read support >>> - Added support for Polars DataFrame and LazyFrame >>> - Enabled column projection for efficient data access >>> - Added table statistics >>> - Implemented table UPSERT support with improved deduplication logic >>> - Added support for Location Providers >>> - Extended support for write paths with Location Providers >>> (write.data.path, write.metadata.path) >>> - Removed table name parsing from row_filter >>> - Cross region S3 read support >>> >>> *Bug Fixes:* >>> >>> - Resolved issues reading from multiple S3 regions >>> - Corrected URL encoding of partition field names >>> - Fixed a TypeError issue with composite keys >>> - Ensured proper handling of missing Parquet metadata >>> >>> *Documentation & Infrastructure:* >>> >>> - Improved contribution guidelines >>> - Automated CI with nightly builds and release candidate processes >>> - Automate building Release Candidate in github action >>> >>> The commit ID is 5cfb52710d84a48426df086e2154ee261da0c666 >>> >>> - This corresponds to the tag: pyiceberg-0.9.0rc2 >>> (2bf90ec5081c358630388d71deb1db1b0381f849) >>> - >>> https://github.com/apache/iceberg-python/releases/tag/pyiceberg-0.9.0rc2 >>> - >>> >>> https://github.com/apache/iceberg-python/tree/5cfb52710d84a48426df086e2154ee261da0c666 >>> >>> >>> The release tarball, signature, and checksums are here: >>> >>> - https://dist.apache.org/repos/dist/dev/iceberg/pyiceberg-0.9.0rc2/ >>> >>> >>> You can find the KEYS file here: >>> >>> - https://downloads.apache.org/iceberg/KEYS >>> >>> >>> Convenience binary artifacts are staged on pypi: >>> >>> - https://pypi.org/project/pyiceberg/0.9.0rc2/ >>> >>> And can be installed using: pip3 install pyiceberg==0.9.0rc2 >>> >>> >>> Instructions for verifying a release can be found here: >>> >>> - https://py.iceberg.apache.org/verify-release/ >>> >>> >>> Please download, verify, and test. >>> >>> Please vote in the next 72 hours. >>> [ ] +1 Release this as PyIceberg 0.9.0 >>> [ ] +0 >>> [ ] -1 Do not release this because... >>> >>