Thanks Peter for driving this FLIP.

One minor comment: The code for API SinkCommitterMetricGroup should be updated 
as description,  SinkWriterMetricGroup looks like a typo.

Best,
Leonard

> 2023年10月10日 下午1:21,Péter Váry <peter.vary.apa...@gmail.com> 写道:
> 
> Hi everyone,
> 
> It seems we have no more comments. So I would like to start a vote tomorrow
> if there are no further things to discuss.
> 
> Please let me know if you have any concerns, thanks!
> 
> 
> Best,
> Peter
> 
> On Thu, Oct 5, 2023, 12:53 Gabor Somogyi <gabor.g.somo...@gmail.com> wrote:
> 
>> Thanks for the efforts Peter!
>> 
>> I've just analyzed it through and I think it's useful feature.
>> 
>> +1 from my side.
>> 
>> G
>> 
>> 
>> On Thu, Oct 5, 2023 at 12:35 PM Péter Váry <peter.vary.apa...@gmail.com>
>> wrote:
>> 
>>> For the record, after the rename, the new FLIP link is:
>>> 
>>> 
>> https://cwiki.apache.org/confluence/display/FLINK/FLIP-371%3A+Provide+initialization+context+for+Committer+creation+in+TwoPhaseCommittingSink
>>> 
>>> Thanks,
>>> Peter
>>> 
>>> Péter Váry <peter.vary.apa...@gmail.com> ezt írta (időpont: 2023. okt.
>> 5.,
>>> Cs, 11:02):
>>> 
>>>> Thanks Gordon for the comments!
>>>> 
>>>>   1. I have changed the FLIP name to the one proposed by you.
>>>>   2. In the Iceberg sink we need access only to the Flink metrics. We
>> do
>>>>   not specifically need the job ID in the Committer after the SinkV2
>>>>   migration (more about that later). This is the reason why I have
>>> stated in
>>>>   the FLIP, that *"We should also provide similar context information
>> as
>>>>   we do in the Writer’s case, which should be discussed further on the
>>>>   mailing list."*. What I did is: I have just copied most of the
>>>>   org.apache.flink.api.connector.sink2.InitContext [1] properties to
>> the
>>>>   CommitterInitContext and removed the ones which seemed excessive to
>>> me.
>>>>   The API in the FLIP is only a draft, and I am open to any
>> suggestions.
>>>> 
>>>> In the new Iceberg Sink we need unique ids for the Committables, but we
>>>> generate them in the SinkV2Aggregator [2] which is placed into the
>>>> PreCommitTopology. The aggregator has access to the job ID, operator ID
>>> and
>>>> checkpoint ID. So no new info is needed on the Committer side there.
>>>> 
>>>> Thanks,
>>>> Peter
>>>> 
>>>> - [1]
>>>> 
>>> 
>> https://github.com/apache/flink/blob/cd95b560d0c11a64b42bf6b98107314d32a4de86/flink-core/src/main/java/org/apache/flink/api/connector/sink2/Sink.java#L95-L131
>>>> - [2]
>>>> 
>>> 
>> https://github.com/apache/iceberg/pull/8653/files#diff-bfd6a564485ec60b2d53f7aa800451548d4713c5f797e76bcff95d2c8ae05ed1R77-R81
>>>> 
>>>> Tzu-Li (Gordon) Tai <tzuli...@apache.org> ezt írta (időpont: 2023.
>> okt.
>>>> 5., Cs, 8:16):
>>>> 
>>>>> Thanks Peter for starting the FLIP.
>>>>> 
>>>>> Overall, this seems pretty straightforward and overdue, +1.
>>>>> 
>>>>> Two quick question / comments:
>>>>> 
>>>>>   1. Can you rename the FLIP to something less generic? Perhaps
>>> "Provide
>>>>>   initialization context for Committer creation in
>>>>> TwoPhaseCommittingSink"?
>>>>>   2. Can you describe why the job ID is needed to be exposed?
>> Although
>>>>>   it's out of scope for this FLIP, I'm wondering if there's a need to
>>> do
>>>>> the
>>>>>   same for the sink writer InitContext.
>>>>> 
>>>>> Thanks,
>>>>> Gordon
>>>>> 
>>>>> On Wed, Oct 4, 2023 at 11:20 AM Martijn Visser <
>>> martijnvis...@apache.org>
>>>>> wrote:
>>>>> 
>>>>>> Hi all,
>>>>>> 
>>>>>> Peter, Marton, Gordon and I had an offline sync on SinkV2 and I'm
>>>>>> happy with this first FLIP on the topic. +1
>>>>>> 
>>>>>> Best regards,
>>>>>> 
>>>>>> Martijn
>>>>>> 
>>>>>> On Wed, Oct 4, 2023 at 5:48 PM Márton Balassi <
>>> balassi.mar...@gmail.com
>>>>>> 
>>>>>> wrote:
>>>>>>> 
>>>>>>> Thanks, Peter. I agree that this is needed for Iceberg and
>>> beneficial
>>>>> for
>>>>>>> other connectors too.
>>>>>>> 
>>>>>>> +1
>>>>>>> 
>>>>>>> On Wed, Oct 4, 2023 at 3:56 PM Péter Váry <
>>>>> peter.vary.apa...@gmail.com>
>>>>>>> wrote:
>>>>>>> 
>>>>>>>> Hi Team,
>>>>>>>> 
>>>>>>>> In my previous email[1] I have described our challenges
>> migrating
>>>>> the
>>>>>>>> existing Iceberg SinkFunction based implementation, to the new
>>>>> SinkV2
>>>>>> based
>>>>>>>> implementation.
>>>>>>>> 
>>>>>>>> As a result of the discussion around that topic, I have created
>>> the
>>>>>> first
>>>>>>>> [2] of the FLIP-s addressing the missing features there.
>>>>>>>> 
>>>>>>>> The main goal of the FLIP-371 is to extend the currently
>> existing
>>>>>> Committer
>>>>>>>> API by providing an initial context on Committer creation. This
>>>>> context
>>>>>>>> will contain - among other, less interesting data -
>>>>>>>> the SinkCommitterMetricGroup which could be used to store the
>>>>> generic
>>>>>>>> commit related metrics, and also provide a way for the Committer
>>> to
>>>>>> publish
>>>>>>>> its own metrics.
>>>>>>>> 
>>>>>>>> The feature has already been requested through FLINK-25857 [3].
>>>>>>>> 
>>>>>>>> Please use this thread to discuss the FLIP related questions,
>>>>>> proposals,
>>>>>>>> feedback.
>>>>>>>> 
>>>>>>>> Thanks,
>>>>>>>> Peter
>>>>>>>> 
>>>>>>>> - [1]
>>>>> https://lists.apache.org/thread/h3kg7jcgjstpvwlhnofq093vk93ylgsn
>>>>>>>> - [2]
>>>>>>>> 
>>>>>>>> 
>>>>>> 
>>>>> 
>>> 
>> https://cwiki.apache.org/confluence/display/FLINK/FLIP-371%3A+SinkV2+Committer+imporvements
>>>>>>>> - [3] https://issues.apache.org/jira/browse/FLINK-25857
>>>>>>>> 
>>>>>> 
>>>>> 
>>>> 
>>> 
>> 

Reply via email to