Hi everyone,

We found a change in default behavior for the data file location in the new
0.9.0 release. The `ObjectStoreLocationProvider` was turned on by default
whereas it was off and not available in previous versions. Previous
versions will write to `data/` directory, whereas the new version will
write to `data/<hash>/` directory.
I created an issue at https://github.com/apache/iceberg-python/issues/1721
and the fix at https://github.com/apache/iceberg-python/pull/1722
I'll follow up and cut a new RC.

Thanks,
Kevin Liu

On Tue, Feb 25, 2025 at 1:43 PM Fokko Driesprong <fo...@apache.org> wrote:

> +1 (binding)
>
>    - Ran signatures/checksum/licenses checks
>    - Ran tests locally (3.10)
>    - Did some local testing
>
> Thanks for running this release Drew!
>
> Kind regards,
> Fokko
>
> Op di 25 feb 2025 om 07:20 schreef Kevin Liu <kevinjq...@apache.org>:
>
>> +1 (non-binding)
>>
>> Downloaded from SVN
>> * gpg: Good signature from "Drew Gallardo <img...@gmail.com>" [unknown]
>> * Checksum OK
>> * RAT checks passed. 1 extra ERROR line but is unrelated
>> ```
>> ERROR: Ignored 0 lines in your exclusion files as comments or empty lines.
>> ```
>> * Both unit and integration test passed with Python 3.12.8 on M1 and
>> Orbstack
>>
>> BTW, email for RC1 was not sent because we found an issue right away.
>> More details can be found in this devlist thread,
>> https://lists.apache.org/thread/rr8lcf96jl6079dz6vfkwr5spbvlxzpm
>>
>> Thanks for driving the release, Drew!
>>
>> Best,
>> Kevin Liu
>>
>>
>> On Mon, Feb 24, 2025 at 1:36 PM Drew <img...@gmail.com> wrote:
>>
>>> Hi Everyone,
>>>
>>> I propose that we release the following RC as the official PyIceberg
>>> 0.9.0 release.
>>>
>>> A summary of the high level features:
>>>
>>>    - 235 new commits
>>>
>>> *High Level Features:*
>>>
>>>    - Implemented support for Alibaba OSS protocol in PyArrowFileIO
>>>    - Enabled Dynamic Overwrite capability
>>>    - Introduced V3 read support
>>>    - Added support for Polars DataFrame and LazyFrame
>>>    - Enabled column projection for efficient data access
>>>    - Added table statistics
>>>    - Implemented table UPSERT support with improved deduplication logic
>>>    - Added support for Location Providers
>>>    - Extended support for write paths with Location Providers
>>>    (write.data.path, write.metadata.path)
>>>    - Removed table name parsing from row_filter
>>>    - Cross region S3 read support
>>>
>>> *Bug Fixes:*
>>>
>>>    - Resolved issues reading from multiple S3 regions
>>>    - Corrected URL encoding of partition field names
>>>    - Fixed a TypeError issue with composite keys
>>>    - Ensured proper handling of missing Parquet metadata
>>>
>>> *Documentation & Infrastructure:*
>>>
>>>    - Improved contribution guidelines
>>>    - Automated CI with nightly builds and release candidate processes
>>>    - Automate building Release Candidate in github action
>>>
>>> The commit ID is 5cfb52710d84a48426df086e2154ee261da0c666
>>>
>>>    - This corresponds to the tag: pyiceberg-0.9.0rc2
>>>    (2bf90ec5081c358630388d71deb1db1b0381f849)
>>>    -
>>>    https://github.com/apache/iceberg-python/releases/tag/pyiceberg-0.9.0rc2
>>>    -
>>>    
>>> https://github.com/apache/iceberg-python/tree/5cfb52710d84a48426df086e2154ee261da0c666
>>>
>>>
>>> The release tarball, signature, and checksums are here:
>>>
>>>    - https://dist.apache.org/repos/dist/dev/iceberg/pyiceberg-0.9.0rc2/
>>>
>>>
>>> You can find the KEYS file here:
>>>
>>>    - https://downloads.apache.org/iceberg/KEYS
>>>
>>>
>>> Convenience binary artifacts are staged on pypi:
>>>
>>>    - https://pypi.org/project/pyiceberg/0.9.0rc2/
>>>
>>> And can be installed using: pip3 install pyiceberg==0.9.0rc2
>>>
>>>
>>> Instructions for verifying a release can be found here:
>>>
>>>    - https://py.iceberg.apache.org/verify-release/
>>>
>>>
>>> Please download, verify, and test.
>>>
>>> Please vote in the next 72 hours.
>>> [ ] +1 Release this as PyIceberg 0.9.0
>>> [ ] +0
>>> [ ] -1 Do not release this because...
>>>
>>

Reply via email to