I'm +0. I definitely agree with the premise that we need a spec change to
ensure added rows exist at the snapshot level for row lineage, but I feel
like there is an advantage to just formalizing the added-records snapshot
summary property, and make it required for writers in case row lineage is
enabled on the table. The advantage is that in the ecosystem more
implementations are likely to populate the summary already (beyond the Java
implementation, I see Python does as well) so for those implementations,
the lift to support row lineage is a little bit reduced since the field
will probably already be populated. It also avoids any awkwardness around
having 2 of essentially the same field in metadata.

In the end, I think that is a minor advantage so I'm not very opinionated
on this. We're talking about one field, and the additional lift for that is
some slightly additional parsing handling in implementations which in the
grand scheme of things is a smaller portion of the work involved. I also
understand the argument that it's awkward to have required fields be in the
summary in the first place (thinking back to our discussions around
operation handling).

Thanks,

Amogh Jahagirdar

On Thu, Jan 16, 2025 at 2:52 PM Prashant Singh <prashant010...@gmail.com>
wrote:

> +1 (non-binding) !
>
> Best,
> Prashant Singh
>
> On Thu, Jan 16, 2025 at 1:14 PM Daniel Weeks <dwe...@apache.org> wrote:
>
>> +1
>>
>> On Thu, Jan 16, 2025 at 10:39 AM Steve Zhang
>> <hongyue_zh...@apple.com.invalid> wrote:
>>
>>> Thank you Russell! +1 (non-binding)
>>>
>>> Thanks,
>>> Steve Zhang
>>>
>>>
>>>
>>> On Jan 15, 2025, at 10:53 PM, huaxin gao <huaxin.ga...@gmail.com> wrote:
>>>
>>> +1 (non-binding)
>>>
>>>
>>>

Reply via email to