Hi Steven,

There are constraints about Spark cherrypick_snapshot and publish_changes
procedure https://iceberg.apache.org/docs/nightly/spark-procedures/#output_3
* for cherrypick_snapshot operation, it says "Only append and dynamic
overwrite snapshots can be cherry-picked.".
* for publish_changes operation, it says "Only append and dynamic overwrite
snapshots can be successfully published."

So I was hoping we can have support for cherrypick static overwrite
snapshot capabilities.

Sorry that I made you confused.

On Thu, May 30, 2024 at 1:08 PM Steven Wu <stevenz...@gmail.com> wrote:

> Pucheng,
>
> I am not sure about others. At least I had some hard time understanding
> what the problem/proposal is. What is "cherrypick static partition
> overwrite"?
>
> Thanks,
> Steven
>
> On Thu, May 30, 2024 at 11:59 AM Pucheng Yang <py...@pinterest.com.invalid>
> wrote:
>
>> Hi community,
>>
>> I would like to follow up on this proposal and would like to check if
>> anyone has concerns about the proposed implementation from a high level
>> perspective?
>>
>> Thank you very much
>>
>> Best,
>> Pucheng
>>
>> On Tue, May 28, 2024 at 9:02 PM Pucheng Yang <py...@pinterest.com> wrote:
>>
>>> Hi community,
>>>
>>> My client is looking for the support of cherrypick static
>>> partition overwrite. Based on my understanding, the reason we can not do it
>>> is because we do not preserve static overwrite filters.
>>>
>>> I would like to make a proposal to support cherrypick static overwrite:
>>> 1. We will allow users to opt in preserving static overwrite filters
>>> during snapshot summary
>>> 2. When the snapshot is being cherry picked, we will first delete from
>>> the current snapshot using the static overwrite fitler preserved in the
>>> snapshot, then we will redo the add files operation.
>>>
>>> Please share if you have any thoughts or comments on this, if we agree,
>>> I can open up a PR for this.
>>>
>>> Best,
>>> Pucheng
>>>
>>

Reply via email to