Re: [VOTE] FLIP-281: Sink Supports Speculative Execution For Batch Job

2023-01-11 Thread Martijn Visser
+0 (binding)

Op di 10 jan. 2023 om 13:11 schreef yuxia :

> +1 (non-binding).
>
> Best regards,
> Yuxia
>
> - 原始邮件 -
> 发件人: "Zhu Zhu" 
> 收件人: "dev" 
> 发送时间: 星期二, 2023年 1 月 10日 下午 5:50:39
> 主题: Re: [VOTE] FLIP-281: Sink Supports Speculative Execution For Batch Job
>
> +1 (binding)
>
> Thanks,
> Zhu
>
> Biao Liu  于2023年1月5日周四 10:37写道:
> >
> > Hi Martijn,
> >
> > Sure, thanks for the reminder about the holiday period.
> > Looking forward to your feedback!
> >
> > Thanks,
> > Biao /'bɪ.aʊ/
> >
> >
> >
> > On Thu, 5 Jan 2023 at 03:07, Martijn Visser 
> > wrote:
> >
> > > Hi Biao,
> > >
> > > To be honest, I haven't read the FLIP yet since this is still a holiday
> > > period in Europe. I would like to read it in the next few days. Can you
> > > keep the vote open a little longer?
> > >
> > > Best regards,
> > >
> > > Martijn
> > >
> > > On Wed, Jan 4, 2023 at 1:31 PM Biao Liu  wrote:
> > >
> > > > Hi everyone,
> > > >
> > > > Thanks for all the feedback!
> > > >
> > > > Based on the discussion[1], we seem to have a consensus. So I'd like
> to
> > > > start a vote on FLIP-281: Sink Supports Speculative Execution For
> Batch
> > > > Job[2]. The vote will last for 72 hours, unless there is an
> objection or
> > > > insufficient votes.
> > > >
> > > > [1] https://lists.apache.org/thread/l05m6cf8fwkkbpnjtzbg9l2lo40oxzd1
> > > > [2]
> > > >
> > > >
> > >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-281+Sink+Supports+Speculative+Execution+For+Batch+Job
> > > >
> > > > Thanks,
> > > > Biao /'bɪ.aʊ/
> > > >
> > >
>


Re: [DISCUSS] FLIP-281 Sink Supports Speculative Execution For Batch Job

2023-01-11 Thread Martijn Visser
Hi Biao,

While I rather wouldn't add new features to (to-be) deprecated features, I
would be +0 for this.

Best regards,

Martijn

Op do 12 jan. 2023 om 08:42 schreef Biao Liu :

> Hi Martijn,
>
> Thanks for your feedback!
>
> Yes, we propose to support speculative execution for SinkFunction.
> 1. From the perspective of compatibility, SinkFunction is the most original
> Sink implementation.There are lots of implementations based on
> SinkFunction, not only in Flink official codebase but also in user's
> private codebase. It's a more serious issue than Sink V1. Of course we hope
> users could migrate the legacy implementation to the new interface. However
> migration is always hard.
> 2. From the perspective of cost, we don't need to do much extra work to
> support speculative execution for SinkFunction. All we need to do is check
> whether the SinkFunction implementation
> inherits SupportsConcurrentExecutionAttempts or not. The other parts of
> work are the same with Sink V2.
>
> To summarize, it's cheap to support speculative execution for SinkFunction.
> And it may allow more existing scenarios to run with speculative execution.
>
> Thanks,
> Biao /'bɪ.aʊ/
>
>
>
> On Wed, 11 Jan 2023 at 21:22, Martijn Visser 
> wrote:
>
> > Hi Biao,
> >
> > Apologies for the late jumping in. My only question is about
> SinkFunction,
> > does this imply that we want to add support for this to the SinkFunction?
> > If so, I would not be in favour of that since we would like to deprecate
> (I
> > actually thought that was already the case) the SinkFunction in favour of
> > SinkV2.
> >
> > Besides that, I have no other comments.
> >
> > Best regards,
> >
> > Martijn
> >
> > On Wed, Jan 4, 2023 at 7:28 AM Jing Zhang  wrote:
> >
> > > Hi Biao,
> > >
> > > Thanks for explanation.
> > >
> > > +1 for the proposal.
> > >
> > > Best,
> > > Jing Zhang
> > >
> > > Lijie Wang  于2023年1月4日周三 12:11写道:
> > >
> > > > Hi Biao,
> > > >
> > > > Thanks for the explanation of how SinkV2  knows the right subtask
> > > > attempt. I have no more questions, +1 for the proposal.
> > > >
> > > > Best,
> > > > Lijie
> > > >
> > > > Biao Liu  于2022年12月28日周三 17:22写道:
> > > >
> > > > > Thanks for all your feedback!
> > > > >
> > > > > To @Yuxia,
> > > > >
> > > > > > What the sink expect to do to isolate data produced by
> speculative
> > > > > > executions?  IIUC, if the taks failover, it also generate a new
> > > > attempt.
> > > > > > Does it make difference in isolating data produced?
> > > > >
> > > > >
> > > > > Yes there is something different from the task failover scenario.
> The
> > > > > attempt number is more necessary for speculative execution than
> > > failover.
> > > > > Because there can be only one subtask instance running at the same
> > time
> > > > in
> > > > > the failover scenario.
> > > > >
> > > > > Let's take FileSystemOutputFormat as an example. For the failover
> > > > scenario,
> > > > > the temporary directory to store produced data can be something
> like
> > > > > "$root_dir/task-$taskNumber/". At the initialization phase, subtask
> > > > deletes
> > > > > and re-creates the temporary directory.
> > > > >
> > > > > However in the speculative execution scenario, it does not work
> > because
> > > > > there might be several subtasks running at the same time. These
> > > subtasks
> > > > > might delete, re-create and write the same temporary directory at
> the
> > > > > same time. The correct temporary directory should be like
> > > > > "$root_dir/task-$taskNumber/attempt-$attemptNumber". So it's
> > necessary
> > > to
> > > > > expose the attempt number to the Sink implementation to do the data
> > > > > isolation.
> > > > >
> > > > >
> > > > > To @Lijie,
> > > > >
> > > > > > I have a question about this: does SinkV2 need to do the same
> > thing?
> > > > >
> > > > >
> > > > > Actually, yes.
> > > > >
> > > > > Should we/users do it in the committer? If yes, how does the
> commiter
> > > > know
> > > > > > which one is the right subtask attempt?
> > > > >
> > > > >
> > > > > Yes, we/users should do it in the committer.
> > > > >
> > > > > In the current design, the Committer of Sink V2 should get the
> "which
> > > one
> > > > > is the right subtask attempt" information from the "committable
> > data''
> > > > > produced by SinkWriter. Let's take the FileSink as example, the
> > > > > "committable data" sent to the Committer contains the full path of
> > the
> > > > > files produced by SinkWriter. Users could also pass the attempt
> > number
> > > > > through "committable data" from SinkWriter to Committer.
> > > > >
> > > > > In the "Rejected Alternatives -> Introduce a way to clean leaked
> data
> > > of
> > > > > Sink V2" section of the FLIP document, we discussed some of the
> > reasons
> > > > > that we didn't provide the API like OutputFormat.
> > > > >
> > > > > To @Jing Zhang
> > > > >
> > > > > I have a question about this: Speculative execution of Committer
> will
> > > be
> > > > > > disabled.
> > > > >
> > > 

Re: [DISCUSS] FLIP-281 Sink Supports Speculative Execution For Batch Job

2023-01-11 Thread Biao Liu
Hi Martijn,

Thanks for your feedback!

Yes, we propose to support speculative execution for SinkFunction.
1. From the perspective of compatibility, SinkFunction is the most original
Sink implementation.There are lots of implementations based on
SinkFunction, not only in Flink official codebase but also in user's
private codebase. It's a more serious issue than Sink V1. Of course we hope
users could migrate the legacy implementation to the new interface. However
migration is always hard.
2. From the perspective of cost, we don't need to do much extra work to
support speculative execution for SinkFunction. All we need to do is check
whether the SinkFunction implementation
inherits SupportsConcurrentExecutionAttempts or not. The other parts of
work are the same with Sink V2.

To summarize, it's cheap to support speculative execution for SinkFunction.
And it may allow more existing scenarios to run with speculative execution.

Thanks,
Biao /'bɪ.aʊ/



On Wed, 11 Jan 2023 at 21:22, Martijn Visser 
wrote:

> Hi Biao,
>
> Apologies for the late jumping in. My only question is about SinkFunction,
> does this imply that we want to add support for this to the SinkFunction?
> If so, I would not be in favour of that since we would like to deprecate (I
> actually thought that was already the case) the SinkFunction in favour of
> SinkV2.
>
> Besides that, I have no other comments.
>
> Best regards,
>
> Martijn
>
> On Wed, Jan 4, 2023 at 7:28 AM Jing Zhang  wrote:
>
> > Hi Biao,
> >
> > Thanks for explanation.
> >
> > +1 for the proposal.
> >
> > Best,
> > Jing Zhang
> >
> > Lijie Wang  于2023年1月4日周三 12:11写道:
> >
> > > Hi Biao,
> > >
> > > Thanks for the explanation of how SinkV2  knows the right subtask
> > > attempt. I have no more questions, +1 for the proposal.
> > >
> > > Best,
> > > Lijie
> > >
> > > Biao Liu  于2022年12月28日周三 17:22写道:
> > >
> > > > Thanks for all your feedback!
> > > >
> > > > To @Yuxia,
> > > >
> > > > > What the sink expect to do to isolate data produced by speculative
> > > > > executions?  IIUC, if the taks failover, it also generate a new
> > > attempt.
> > > > > Does it make difference in isolating data produced?
> > > >
> > > >
> > > > Yes there is something different from the task failover scenario. The
> > > > attempt number is more necessary for speculative execution than
> > failover.
> > > > Because there can be only one subtask instance running at the same
> time
> > > in
> > > > the failover scenario.
> > > >
> > > > Let's take FileSystemOutputFormat as an example. For the failover
> > > scenario,
> > > > the temporary directory to store produced data can be something like
> > > > "$root_dir/task-$taskNumber/". At the initialization phase, subtask
> > > deletes
> > > > and re-creates the temporary directory.
> > > >
> > > > However in the speculative execution scenario, it does not work
> because
> > > > there might be several subtasks running at the same time. These
> > subtasks
> > > > might delete, re-create and write the same temporary directory at the
> > > > same time. The correct temporary directory should be like
> > > > "$root_dir/task-$taskNumber/attempt-$attemptNumber". So it's
> necessary
> > to
> > > > expose the attempt number to the Sink implementation to do the data
> > > > isolation.
> > > >
> > > >
> > > > To @Lijie,
> > > >
> > > > > I have a question about this: does SinkV2 need to do the same
> thing?
> > > >
> > > >
> > > > Actually, yes.
> > > >
> > > > Should we/users do it in the committer? If yes, how does the commiter
> > > know
> > > > > which one is the right subtask attempt?
> > > >
> > > >
> > > > Yes, we/users should do it in the committer.
> > > >
> > > > In the current design, the Committer of Sink V2 should get the "which
> > one
> > > > is the right subtask attempt" information from the "committable
> data''
> > > > produced by SinkWriter. Let's take the FileSink as example, the
> > > > "committable data" sent to the Committer contains the full path of
> the
> > > > files produced by SinkWriter. Users could also pass the attempt
> number
> > > > through "committable data" from SinkWriter to Committer.
> > > >
> > > > In the "Rejected Alternatives -> Introduce a way to clean leaked data
> > of
> > > > Sink V2" section of the FLIP document, we discussed some of the
> reasons
> > > > that we didn't provide the API like OutputFormat.
> > > >
> > > > To @Jing Zhang
> > > >
> > > > I have a question about this: Speculative execution of Committer will
> > be
> > > > > disabled.
> > > >
> > > > I agree with your point and I saw the similar requirements to disable
> > > > speculative
> > > > > execution for specified operators.
> > > >
> > > > However the requirement is not supported currently. I think there
> > > > should be some
> > > > > place to describe how to support it.
> > > >
> > > >
> > > > In this FLIP design, the speculative execution of Committer of Sink
> V2
> > > will
> > > > be disabled by Flink. It's not an optional operation. Users can not

Re: [VOTE] Apache Flink Table Store 0.3.0, release candidate #1

2023-01-11 Thread Jingsong Li
Thanks Yu for your validation.

I created a new staging directory [1]

[1] https://repository.apache.org/content/repositories/orgapacheflink-1577/

Best,
Jingsong

On Thu, Jan 12, 2023 at 3:07 PM Yu Li  wrote:
>
> Hi Jingsong,
>
> It seems the given staging directory [1] is not exposed, could you double
> check and republish if necessary? Thanks.
>
> Best Regards,
> Yu
>
> [1] https://repository.apache.org/content/repositories/orgapacheflink-1576/
>
>
> On Tue, 10 Jan 2023 at 16:53, Jingsong Li  wrote:
>
> > Hi everyone,
> >
> > Please review and vote on the release candidate #1 for the version
> > 0.3.0 of Apache Flink Table Store, as follows:
> >
> > [ ] +1, Approve the release
> > [ ] -1, Do not approve the release (please provide specific comments)
> >
> > **Release Overview**
> >
> > As an overview, the release consists of the following:
> > a) Table Store canonical source distribution to be deployed to the
> > release repository at dist.apache.org
> > b) Table Store binary convenience releases to be deployed to the
> > release repository at dist.apache.org
> > c) Maven artifacts to be deployed to the Maven Central Repository
> >
> > **Staging Areas to Review**
> >
> > The staging areas containing the above mentioned artifacts are as follows,
> > for your review:
> > * All artifacts for a) and b) can be found in the corresponding dev
> > repository at dist.apache.org [2]
> > * All artifacts for c) can be found at the Apache Nexus Repository [3]
> >
> > All artifacts are signed with the key
> > 2C2B6A653B07086B65E4369F7C76245E0A318150 [4]
> >
> > Other links for your review:
> > * JIRA release notes [5]
> > * source code tag "release-0.3.0-rc1" [6]
> > * PR to update the website Downloads page to include Table Store links [7]
> >
> > **Vote Duration**
> >
> > The voting time will run for at least 72 hours.
> > It is adopted by majority approval, with at least 3 PMC affirmative votes.
> >
> > Best,
> > Jingsong Lee
> >
> > [1]
> > https://cwiki.apache.org/confluence/display/FLINK/Verifying+a+Flink+Table+Store+Release
> > [2]
> > https://dist.apache.org/repos/dist/dev/flink/flink-table-store-0.3.0-rc1/
> > [3]
> > https://repository.apache.org/content/repositories/orgapacheflink-1576/
> > [4] https://dist.apache.org/repos/dist/release/flink/KEYS
> > [5]
> > https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315522=12352111
> > [6] https://github.com/apache/flink-table-store/tree/release-0.3.0-rc1
> > [7] https://github.com/apache/flink-web/pull/601
> >


[jira] [Created] (FLINK-30641) docs_404_check fail

2023-01-11 Thread Lijie Wang (Jira)
Lijie Wang created FLINK-30641:
--

 Summary: docs_404_check fail
 Key: FLINK-30641
 URL: https://issues.apache.org/jira/browse/FLINK-30641
 Project: Flink
  Issue Type: Bug
Reporter: Lijie Wang


{code:java}
Cloning into 'flink-connector-rabbitmq'...
Note: switching to '325b6ba8d866bb02204736229aa54ade304119a3'.

You are in 'detached HEAD' state. You can look around, make experimental
changes and commit them, and you can discard any commits you make in this
state without impacting any branches by switching back to a branch.

If you want to create a new branch to retain commits you create, you may
do so (now or later) by using -c with the switch command. Example:

  git switch -c 

Or undo this operation with:

  git switch -

Turn off this advice by setting config variable advice.detachedHead to false

Start building sites … 
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:45:20":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:46:20":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:53:20":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:54:20":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:62:21":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:63:21":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:103:20":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/formats/overview.md:104:20":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/datastream/dynamodb": 
"/home/vsts/work/1/s/docs/content/docs/connectors/datastream/overview.md:43:22":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/datastream/kinesis": 
"/home/vsts/work/1/s/docs/content/docs/connectors/datastream/overview.md:44:34":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/datastream/firehose": 
"/home/vsts/work/1/s/docs/content/docs/connectors/datastream/overview.md:45:35":
 page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/dynamodb": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/overview.md:70:20": 
page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/overview.md:76:20": 
page not found
ERROR 2023/01/12 06:49:21 [en] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content/docs/connectors/table/overview.md:82:20": 
page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content.zh/docs/connectors/table/formats/overview.md:45:20":
 page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content.zh/docs/connectors/table/formats/overview.md:46:20":
 page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content.zh/docs/connectors/table/formats/overview.md:53:20":
 page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content.zh/docs/connectors/table/formats/overview.md:54:20":
 page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content.zh/docs/connectors/table/formats/overview.md:62:21":
 page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 
"/home/vsts/work/1/s/docs/content.zh/docs/connectors/table/formats/overview.md:63:21":
 page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/kinesis": 
"/home/vsts/work/1/s/docs/content.zh/docs/connectors/table/formats/overview.md:103:20":
 page not found
ERROR 2023/01/12 06:49:30 [zh] REF_NOT_FOUND: Ref 
"docs/connectors/table/firehose": 

Re: [VOTE] Apache Flink Table Store 0.3.0, release candidate #1

2023-01-11 Thread Yu Li
Hi Jingsong,

It seems the given staging directory [1] is not exposed, could you double
check and republish if necessary? Thanks.

Best Regards,
Yu

[1] https://repository.apache.org/content/repositories/orgapacheflink-1576/


On Tue, 10 Jan 2023 at 16:53, Jingsong Li  wrote:

> Hi everyone,
>
> Please review and vote on the release candidate #1 for the version
> 0.3.0 of Apache Flink Table Store, as follows:
>
> [ ] +1, Approve the release
> [ ] -1, Do not approve the release (please provide specific comments)
>
> **Release Overview**
>
> As an overview, the release consists of the following:
> a) Table Store canonical source distribution to be deployed to the
> release repository at dist.apache.org
> b) Table Store binary convenience releases to be deployed to the
> release repository at dist.apache.org
> c) Maven artifacts to be deployed to the Maven Central Repository
>
> **Staging Areas to Review**
>
> The staging areas containing the above mentioned artifacts are as follows,
> for your review:
> * All artifacts for a) and b) can be found in the corresponding dev
> repository at dist.apache.org [2]
> * All artifacts for c) can be found at the Apache Nexus Repository [3]
>
> All artifacts are signed with the key
> 2C2B6A653B07086B65E4369F7C76245E0A318150 [4]
>
> Other links for your review:
> * JIRA release notes [5]
> * source code tag "release-0.3.0-rc1" [6]
> * PR to update the website Downloads page to include Table Store links [7]
>
> **Vote Duration**
>
> The voting time will run for at least 72 hours.
> It is adopted by majority approval, with at least 3 PMC affirmative votes.
>
> Best,
> Jingsong Lee
>
> [1]
> https://cwiki.apache.org/confluence/display/FLINK/Verifying+a+Flink+Table+Store+Release
> [2]
> https://dist.apache.org/repos/dist/dev/flink/flink-table-store-0.3.0-rc1/
> [3]
> https://repository.apache.org/content/repositories/orgapacheflink-1576/
> [4] https://dist.apache.org/repos/dist/release/flink/KEYS
> [5]
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315522=12352111
> [6] https://github.com/apache/flink-table-store/tree/release-0.3.0-rc1
> [7] https://github.com/apache/flink-web/pull/601
>


[jira] [Created] (FLINK-30640) Unstable test in CliClientITCase

2023-01-11 Thread yuzelin (Jira)
yuzelin created FLINK-30640:
---

 Summary: Unstable test in CliClientITCase
 Key: FLINK-30640
 URL: https://issues.apache.org/jira/browse/FLINK-30640
 Project: Flink
  Issue Type: Bug
  Components: Connectors / Hive, Table SQL / Client
Affects Versions: 1.17.0
Reporter: yuzelin


[https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=44743=logs=0c940707-2659-5648-cbe6-a1ad63045f0a=075c2716-8010-5565-fe08-3c4bb45824a4]

 

The failed test can work normally in my local environment.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Xintong Song
Hi all,

AFAICS, there’s no consensus so far on how we should proceed discussions
while some of the participants are unresponsive, in either the Bylaws[1] or
any of the ML threads that I’m aware of. Without such consensus, it is
understandable that contributors take actions based on their different
personal opinions, which IMO causes this argument right now. So let’s try
not to point fingers at each other and focus on setting up a commonly
agreed process on this.

I agree that every opinion, especially the concerns and disagreements,
should be respected and carefully discussed and addressed. Meantime, it is
also important for the Flink project to move forward at a good pace, of
course without sacrificing the quality. So the question is how do we
actively resolve the disagreements and reach consensus efficiently. With
the growth of the community, there’re more developers from various
companies concurrently working on more FLIPs / features / efforts than
before. It’s getting harder and harder for one contributor (e.g., myself)
to participate in every thread that he’s interested in without slowing down
the evolution of the project. In such cases, I personally would tend to
trust other contributors, especially the committers, in doing a good job
without my participant, rather than slowing down others due to my lack of
capacity.

So here’s my proposal:

1. When raising an opinion (question / concern / disagreement) that blocks
the discussion from reaching consensus, the contributor that the opinion
comes from should try to respond to the replies to their opinions within 1
week (5 work days since the replies are made, excluding weekends and public
holidays) if possible.

2. If the response cannot be made within 1 week (due to other works,
personal vacation plans, etc.), an explicit date of response should be
given. The given response date should be no later than 2 weeks (10 work
days since the replies are made, excluding weekends and public holidays),
unless agreed by all active participants in the discussion.

3. A question / concern / disagreement can be considered addressed if: a)
there’s no response in 1 week or before the given response date, and b) all
the active participants agreed that it is resolved.

4. A discussion should be opened for at least 1 week, for opinions to be
raised, before starting a vote. When there’re unresponsive participants, a
72h notice is required before claiming reaching out consensus for a
discussion and starting a vote. This can overlap with the time waiting for
responses from the irresponsive participants.

5. Here *response* is restricted to technical opinions. Quick replies such
as "will take a look asap" do not count.

Looking forward to your opinions.

Best,

Xintong


[1] https://cwiki.apache.org/confluence/display/FLINK/Flink+Bylaws



On Thu, Jan 12, 2023 at 10:07 AM Hang Ruan  wrote:

> Hi, all,
>
> Thanks for all discussions about this FLIP first. We all are trying to make
> Flink better. But not getting a reply quickly really discourages
> contributors.
> OperatorCoordinatorMetricGroup and SplitEnumeratorMetricGroup are important
> for many developers. And many metrics can not be reported without it. This
> FLIP raised on 26 Dec 2022 and has last over a month.
> So let's make the discussion on FLIP-274 be settled.
>
> Thanks for all helps and let's make this FLIP move on together.
>
> Best,
> Hang
>
> Chesnay Schepler  于2023年1月12日周四 07:29写道:
>
> > You know, back when I wrote that line I actually felt a bit guilty about
> > it and nearly dropped it, since it implicitly accused _someone_ of
> > actually being capable/willing to push something through despite voiced
> > concerns while no ones looking.
> >
> > Turns out things are a lot worse that I thought. You're actually
> > doubling down on it and even insult me.
> >
> >
> > As I haven't had a time to revisit the discussion and afaict my concerns
> > weren't addressed, I hereby vote -1 (binding).
> >
> > Please be aware that since I'm effectively only back from vacation
> > tomorrow and need to catch up on things in general, I may not be able to
> > revisit the discussion this week. In part because I guess I now need to
> > double-check everything.
> >
> > CC'ing the private ML because I find this very troubling behavior.
> >
> > On 11/01/2023 16:20, Martijn Visser wrote:
> > > Hi Jark,
> > >
> > > I disagree with your statement that someone doesn't care about the
> FLIP,
> > > especially since that person participated in the initial FLIP
> discussion
> > > and mentioned explicitly "Since I'm on holidays soon, just so no one
> > tries
> > > to pull a fast one on me, if this were to go to a vote as-is I'd be
> > against
> > > it.".
> > >
> > > Anyway: I can't tell when Chesnay exactly will participate, but I do
> > expect
> > > that this week.
> > >
> > > Best regards,
> > >
> > > Martijn
> > >
> > > On Wed, Jan 11, 2023 at 4:11 PM Jark Wu  wrote:
> > >
> > >> Hi Martijn,
> > >>
> > >> Thanks for helping us to find 

Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Lincoln Lee
Thank you all!

I'm honored to join the committers and look forward to continue working
with the community.

Best,
Lincoln Lee


Shengkai Fang  于2023年1月12日周四 09:55写道:

> Congratulations, Lincoln!
>
> Best,
> Shengkai
>
> liu ron  于2023年1月12日周四 09:48写道:
>
> > Congratulations, Lincoln!
> >
> > Best
> >
> > Yu Li  于2023年1月12日周四 09:22写道:
> >
> > > Congratulations, Lincoln!
> > >
> > > Best Regards,
> > > Yu
> > >
> > >
> > > On Wed, 11 Jan 2023 at 21:17, Martijn Visser  >
> > > wrote:
> > >
> > > > Congratulations Lincoln, happy to have you on board!
> > > >
> > > > Best regards, Martijn
> > > >
> > > >
> > > > On Wed, Jan 11, 2023 at 1:49 PM Dong Lin 
> wrote:
> > > >
> > > > > Congratulations, Lincoln!
> > > > >
> > > > > Cheers,
> > > > > Dong
> > > > >
> > > > > On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
> > > > >
> > > > > > Hi everyone,
> > > > > >
> > > > > > On behalf of the PMC, I'm very happy to announce Lincoln Lee as a
> > new
> > > > > Flink
> > > > > > committer.
> > > > > >
> > > > > > Lincoln Lee has been a long-term Flink contributor since 2017. He
> > > > mainly
> > > > > > works on Flink
> > > > > > SQL parts and drives several important FLIPs, e.g., FLIP-232
> (Retry
> > > > Async
> > > > > > I/O), FLIP-234 (
> > > > > > Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides,
> > He
> > > > also
> > > > > > contributed
> > > > > > much to Streaming Semantics, including the non-determinism
> problem
> > > and
> > > > > the
> > > > > > message
> > > > > > ordering problem.
> > > > > >
> > > > > > Please join me in congratulating Lincoln for becoming a Flink
> > > > committer!
> > > > > >
> > > > > > Cheers,
> > > > > > Jark Wu
> > > > > >
> > > > >
> > > >
> > >
> >
>


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Hang Ruan
Hi, all,

Thanks for all discussions about this FLIP first. We all are trying to make
Flink better. But not getting a reply quickly really discourages
contributors.
OperatorCoordinatorMetricGroup and SplitEnumeratorMetricGroup are important
for many developers. And many metrics can not be reported without it. This
FLIP raised on 26 Dec 2022 and has last over a month.
So let's make the discussion on FLIP-274 be settled.

Thanks for all helps and let's make this FLIP move on together.

Best,
Hang

Chesnay Schepler  于2023年1月12日周四 07:29写道:

> You know, back when I wrote that line I actually felt a bit guilty about
> it and nearly dropped it, since it implicitly accused _someone_ of
> actually being capable/willing to push something through despite voiced
> concerns while no ones looking.
>
> Turns out things are a lot worse that I thought. You're actually
> doubling down on it and even insult me.
>
>
> As I haven't had a time to revisit the discussion and afaict my concerns
> weren't addressed, I hereby vote -1 (binding).
>
> Please be aware that since I'm effectively only back from vacation
> tomorrow and need to catch up on things in general, I may not be able to
> revisit the discussion this week. In part because I guess I now need to
> double-check everything.
>
> CC'ing the private ML because I find this very troubling behavior.
>
> On 11/01/2023 16:20, Martijn Visser wrote:
> > Hi Jark,
> >
> > I disagree with your statement that someone doesn't care about the FLIP,
> > especially since that person participated in the initial FLIP discussion
> > and mentioned explicitly "Since I'm on holidays soon, just so no one
> tries
> > to pull a fast one on me, if this were to go to a vote as-is I'd be
> against
> > it.".
> >
> > Anyway: I can't tell when Chesnay exactly will participate, but I do
> expect
> > that this week.
> >
> > Best regards,
> >
> > Martijn
> >
> > On Wed, Jan 11, 2023 at 4:11 PM Jark Wu  wrote:
> >
> >> Hi Martijn,
> >>
> >> Thanks for helping us to find Chesnay.
> >> Could you clarify how "soon" will Chesnay participate? A day or a week?
> >>
> >> I agree with your improvement proposal and Hang exactly follows your
> >> points.
> >> The last-call-for-discussion for Chensay has been sent for more than 72
> >> hours,
> >> even after the Christmas holiday. Therefore, from my point of view, this
> >> FLIP
> >> should have been passed. On the other hand, the Flink Bylaws[1] don't
> >> require
> >> voting to begin until a specific person responds. The 3 days (even work
> >> days) voting
> >> length already considers the committers' reaction time. If someone
> doesn't
> >> vote -1
> >> during the time period, which means he/she doesn't have concerns about
> the
> >> proposal.
> >> If you care about the proposal, PLEASE VOTE and explain the reasons.
> This
> >> is how
> >> the 200+ FLIPs work until today. I'm just very disappointed and
> surprised
> >> we have to
> >> wait for someone to vote for the FLIP who doesn't care about the FLIP
> for
> >> more than a month.
> >>
> >>
> >> Best,
> >> Jark
> >>
> >> [1]: https://cwiki.apache.org/confluence/display/FLINK/Flink+Bylaws
> >>
> >> On Wed, 11 Jan 2023 at 21:14, Martijn Visser 
> >> wrote:
> >>
> >>> Hi Jark and Dong,
> >>>
> >>> I fully understand your concerns in this case. I also think that this
> >>> situation is an exception. This discussion started just before the
> >> holiday
> >>> season in Europe started. The request to Chesnay if he had more
> comments
> >>> was sent just a couple of days before Christmas.
> >>>
> >>>  From my perspective, given how the discussion started in the
> discussion
> >>> thread and the context that was provided, I would have sent an email
> that
> >>> if there are no more comments in the next 72 hours, you would open a
> vote
> >>> thread. Especially if someone raised a concern first. That has
> happened a
> >>> lot on other discussion threads as well, even when there were no more
> >> open
> >>> discussion topics.
> >>>
> >>> While I fully understand the disappointment from your point of view,
> the
> >>> other way around it feels disappointing that this was brought to a
> vote.
> >> So
> >>> let's use the disappointments from both ends to learn and to improve
> >>> overall. Something like:
> >>>
> >>> * If someone raises concerns during a discussion, of course first try
> to
> >>> resolve all concerns.
> >>> * If that person no longer participates in the discussion, send a
> >>> last-call-for-discussions in the discussion thread for 72 hours and
> else
> >>> you will open a vote thread
> >>> * Follow the regular voting process
> >>>
> >>> For this specific FLIP, I've briefly talked to Chesnay offline and I'm
> >> sure
> >>> he will participate soon to unblock it.
> >>>
> >>> Best regards,
> >>>
> >>> Martijn
> >>>
> >>> On Wed, Jan 11, 2023 at 1:47 PM Dong Lin  wrote:
> >>>
>  Hi Martijn,
> 
>  A collaborative and active community is very important for any
> >>> open-source
>  project. Flink 

Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Shengkai Fang
Congratulations, Lincoln!

Best,
Shengkai

liu ron  于2023年1月12日周四 09:48写道:

> Congratulations, Lincoln!
>
> Best
>
> Yu Li  于2023年1月12日周四 09:22写道:
>
> > Congratulations, Lincoln!
> >
> > Best Regards,
> > Yu
> >
> >
> > On Wed, 11 Jan 2023 at 21:17, Martijn Visser 
> > wrote:
> >
> > > Congratulations Lincoln, happy to have you on board!
> > >
> > > Best regards, Martijn
> > >
> > >
> > > On Wed, Jan 11, 2023 at 1:49 PM Dong Lin  wrote:
> > >
> > > > Congratulations, Lincoln!
> > > >
> > > > Cheers,
> > > > Dong
> > > >
> > > > On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
> > > >
> > > > > Hi everyone,
> > > > >
> > > > > On behalf of the PMC, I'm very happy to announce Lincoln Lee as a
> new
> > > > Flink
> > > > > committer.
> > > > >
> > > > > Lincoln Lee has been a long-term Flink contributor since 2017. He
> > > mainly
> > > > > works on Flink
> > > > > SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry
> > > Async
> > > > > I/O), FLIP-234 (
> > > > > Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides,
> He
> > > also
> > > > > contributed
> > > > > much to Streaming Semantics, including the non-determinism problem
> > and
> > > > the
> > > > > message
> > > > > ordering problem.
> > > > >
> > > > > Please join me in congratulating Lincoln for becoming a Flink
> > > committer!
> > > > >
> > > > > Cheers,
> > > > > Jark Wu
> > > > >
> > > >
> > >
> >
>


Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread liu ron
Congratulations, Lincoln!

Best

Yu Li  于2023年1月12日周四 09:22写道:

> Congratulations, Lincoln!
>
> Best Regards,
> Yu
>
>
> On Wed, 11 Jan 2023 at 21:17, Martijn Visser 
> wrote:
>
> > Congratulations Lincoln, happy to have you on board!
> >
> > Best regards, Martijn
> >
> >
> > On Wed, Jan 11, 2023 at 1:49 PM Dong Lin  wrote:
> >
> > > Congratulations, Lincoln!
> > >
> > > Cheers,
> > > Dong
> > >
> > > On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
> > >
> > > > Hi everyone,
> > > >
> > > > On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new
> > > Flink
> > > > committer.
> > > >
> > > > Lincoln Lee has been a long-term Flink contributor since 2017. He
> > mainly
> > > > works on Flink
> > > > SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry
> > Async
> > > > I/O), FLIP-234 (
> > > > Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He
> > also
> > > > contributed
> > > > much to Streaming Semantics, including the non-determinism problem
> and
> > > the
> > > > message
> > > > ordering problem.
> > > >
> > > > Please join me in congratulating Lincoln for becoming a Flink
> > committer!
> > > >
> > > > Cheers,
> > > > Jark Wu
> > > >
> > >
> >
>


Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Yu Li
Congratulations, Lincoln!

Best Regards,
Yu


On Wed, 11 Jan 2023 at 21:17, Martijn Visser 
wrote:

> Congratulations Lincoln, happy to have you on board!
>
> Best regards, Martijn
>
>
> On Wed, Jan 11, 2023 at 1:49 PM Dong Lin  wrote:
>
> > Congratulations, Lincoln!
> >
> > Cheers,
> > Dong
> >
> > On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
> >
> > > Hi everyone,
> > >
> > > On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new
> > Flink
> > > committer.
> > >
> > > Lincoln Lee has been a long-term Flink contributor since 2017. He
> mainly
> > > works on Flink
> > > SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry
> Async
> > > I/O), FLIP-234 (
> > > Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He
> also
> > > contributed
> > > much to Streaming Semantics, including the non-determinism problem and
> > the
> > > message
> > > ordering problem.
> > >
> > > Please join me in congratulating Lincoln for becoming a Flink
> committer!
> > >
> > > Cheers,
> > > Jark Wu
> > >
> >
>


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Chesnay Schepler
You know, back when I wrote that line I actually felt a bit guilty about 
it and nearly dropped it, since it implicitly accused _someone_ of 
actually being capable/willing to push something through despite voiced 
concerns while no ones looking.


Turns out things are a lot worse that I thought. You're actually 
doubling down on it and even insult me.



As I haven't had a time to revisit the discussion and afaict my concerns 
weren't addressed, I hereby vote -1 (binding).


Please be aware that since I'm effectively only back from vacation 
tomorrow and need to catch up on things in general, I may not be able to 
revisit the discussion this week. In part because I guess I now need to 
double-check everything.


CC'ing the private ML because I find this very troubling behavior.

On 11/01/2023 16:20, Martijn Visser wrote:

Hi Jark,

I disagree with your statement that someone doesn't care about the FLIP,
especially since that person participated in the initial FLIP discussion
and mentioned explicitly "Since I'm on holidays soon, just so no one tries
to pull a fast one on me, if this were to go to a vote as-is I'd be against
it.".

Anyway: I can't tell when Chesnay exactly will participate, but I do expect
that this week.

Best regards,

Martijn

On Wed, Jan 11, 2023 at 4:11 PM Jark Wu  wrote:


Hi Martijn,

Thanks for helping us to find Chesnay.
Could you clarify how "soon" will Chesnay participate? A day or a week?

I agree with your improvement proposal and Hang exactly follows your
points.
The last-call-for-discussion for Chensay has been sent for more than 72
hours,
even after the Christmas holiday. Therefore, from my point of view, this
FLIP
should have been passed. On the other hand, the Flink Bylaws[1] don't
require
voting to begin until a specific person responds. The 3 days (even work
days) voting
length already considers the committers' reaction time. If someone doesn't
vote -1
during the time period, which means he/she doesn't have concerns about the
proposal.
If you care about the proposal, PLEASE VOTE and explain the reasons. This
is how
the 200+ FLIPs work until today. I'm just very disappointed and surprised
we have to
wait for someone to vote for the FLIP who doesn't care about the FLIP for
more than a month.


Best,
Jark

[1]: https://cwiki.apache.org/confluence/display/FLINK/Flink+Bylaws

On Wed, 11 Jan 2023 at 21:14, Martijn Visser 
wrote:


Hi Jark and Dong,

I fully understand your concerns in this case. I also think that this
situation is an exception. This discussion started just before the

holiday

season in Europe started. The request to Chesnay if he had more comments
was sent just a couple of days before Christmas.

 From my perspective, given how the discussion started in the discussion
thread and the context that was provided, I would have sent an email that
if there are no more comments in the next 72 hours, you would open a vote
thread. Especially if someone raised a concern first. That has happened a
lot on other discussion threads as well, even when there were no more

open

discussion topics.

While I fully understand the disappointment from your point of view, the
other way around it feels disappointing that this was brought to a vote.

So

let's use the disappointments from both ends to learn and to improve
overall. Something like:

* If someone raises concerns during a discussion, of course first try to
resolve all concerns.
* If that person no longer participates in the discussion, send a
last-call-for-discussions in the discussion thread for 72 hours and else
you will open a vote thread
* Follow the regular voting process

For this specific FLIP, I've briefly talked to Chesnay offline and I'm

sure

he will participate soon to unblock it.

Best regards,

Martijn

On Wed, Jan 11, 2023 at 1:47 PM Dong Lin  wrote:


Hi Martijn,

A collaborative and active community is very important for any

open-source

project. Flink can succeed today because we have many experienced and
passionate developers who collaborate together to develop Flink. It is
important that developers can give constructive feedback and help each
other be productive.

I find it kind of surprising and disappointing that we have to wait for

one

particular developer for more than 30 days to get a reply before being
allowed forward and making progress. It is hard to image what would

happen

if every committer can take 30+ days to reply to a FLIP and still

expects

the FLIP to wait for the reply. Flink community will likely be dead if

this

is the culture that Flink community uses to treat contributors.

Could you explain how long we have to wait before making progress for

this

FLIP? And in the future, what would be the resonable timeframe to wait

for

a reply before we can open the voting thread?

Thanks,
Dong


On Wed, Jan 11, 2023 at 4:17 PM Martijn Visser <

martijnvis...@apache.org

wrote:


-1 (binding) currently: I don't think this should have gone to a vote

yet

given that Chesnay 

Re: "Introduction to Apache Flink" presentation

2023-01-11 Thread Ryan Skraba
Hello Joe -- sorry for the late reply, a colleague just showed me this.

I don't know if you're aware, but there's an incubating project called
Apache Training (Incubating) [1] that collects introductory and training
materials for various Apache projects.  All ASF committers have write
permissions on the incubator-training repository.

For your first point, if you're looking for some common, shared reveal.js
(and asciidoctor)-based tools, this might be a good home for a Flink
presentation.  There's still some room for improvement in the shared code,
of course, but that's the point of sharing the code :D  I did my ApacheCon
Avro presentation with it (but STILL haven't gotten around to contributing
it).

There's not a lot of momentum in the Training project at the moment, but
IMO it's fundamentally a pretty good idea for the community.  If you go
that route, I'd be willing to help out with review and conversion to
reveal.js and asciidoctor; a lot of the conversion work is just fiddling
around with layout until it looks OK.

All that being said, the presentation looks really good and I'd probably be
motivated to see it!  (is there a video yet?)

All my best, Ryan

[1]: https://training.apache.org/ "Training website"
[2]:
https://training.apache.org/presentations/incubator/navigating-asf-incubation/index.html
"Example deployed presentation"
[3]: https://github.com/apache/incubator-training "GitHub repository"

On Thu, Nov 17, 2022 at 11:57 AM Johannes Moser  wrote:

> Hi all,
>
> I was talking at a meetup yesterday about Apache Flink.
> There is this Google Presentation that was created earlier and never has
> been updated. As I had to come up with a presentation anyhow I used this
> and updated it. See the result in [1].
>
> First, I'd like to get feedback on the updated presentation. So please if
> you got some time, have a look at it and comment. Everyone should be able
> to do that.
>
> Having a default presentation that people can use to talk about Apache
> Flink is a good thing, so I'd like to keep that. I want to test the
> sentiment of the community with a couple of connected ideas.
>
> 1) Moving the presentation to a more sustainable technology that can also
> be incorporated better into the Apache Flink workflow.
> I'd suggest using something like reveal.js and adding the presentation to
> flink-web, so everyone has an easier time contributing. I guess the
> presentation would also benefit from a review process and an update could
> be included in release processes.
>
> 2) Adding a list of people that might be open to giving that presentation,
> so that Meetup organizers can reach out to them. Other communities have
> such "community champion" setups. The Apache Flink community would benefit
> from that as well.
>
> Happy to hear your feedback.
>
> Thanks,
> Joe
>
> [1]
>
> https://eu01.z.antigena.com/l/FScYr7uhmgjism6OHA9nu7LhyL6Qnf6vwbVF9hyy6sxoGIX461XJZFXXYApTrdOSPMSaWQXc2cSpICl4dkelyd3n0FcDTlT4~ICYKcio~3oA3xaZByzwUqiDXr-po8ciAUIA12eHuVtoXH7ir9RJCzVwuj45r8Eql9UT9ENdseAP6mzz6vccT-L0V5YgdR98gCssQg5FhqpWPVD9Kx~twT7W5ZuyvHR9jvz91BNEXG-9ITh5
>


[jira] [Created] (FLINK-30639) Publish SBOM for externalized connectors

2023-01-11 Thread Jira
Márton Balassi created FLINK-30639:
--

 Summary: Publish SBOM for externalized connectors
 Key: FLINK-30639
 URL: https://issues.apache.org/jira/browse/FLINK-30639
 Project: Flink
  Issue Type: Sub-task
Reporter: Márton Balassi
Assignee: Zsombor Chikan


Probably adding to the connector parent will be sufficient.
https://github.com/apache/flink-connector-aws/blob/main/pom.xml#L27
https://github.com/zentol/flink-connector-parent/blob/master/pom.xml



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Martijn Visser
Hi Jark,

I disagree with your statement that someone doesn't care about the FLIP,
especially since that person participated in the initial FLIP discussion
and mentioned explicitly "Since I'm on holidays soon, just so no one tries
to pull a fast one on me, if this were to go to a vote as-is I'd be against
it.".

Anyway: I can't tell when Chesnay exactly will participate, but I do expect
that this week.

Best regards,

Martijn

On Wed, Jan 11, 2023 at 4:11 PM Jark Wu  wrote:

> Hi Martijn,
>
> Thanks for helping us to find Chesnay.
> Could you clarify how "soon" will Chesnay participate? A day or a week?
>
> I agree with your improvement proposal and Hang exactly follows your
> points.
> The last-call-for-discussion for Chensay has been sent for more than 72
> hours,
> even after the Christmas holiday. Therefore, from my point of view, this
> FLIP
> should have been passed. On the other hand, the Flink Bylaws[1] don't
> require
> voting to begin until a specific person responds. The 3 days (even work
> days) voting
> length already considers the committers' reaction time. If someone doesn't
> vote -1
> during the time period, which means he/she doesn't have concerns about the
> proposal.
> If you care about the proposal, PLEASE VOTE and explain the reasons. This
> is how
> the 200+ FLIPs work until today. I'm just very disappointed and surprised
> we have to
> wait for someone to vote for the FLIP who doesn't care about the FLIP for
> more than a month.
>
>
> Best,
> Jark
>
> [1]: https://cwiki.apache.org/confluence/display/FLINK/Flink+Bylaws
>
> On Wed, 11 Jan 2023 at 21:14, Martijn Visser 
> wrote:
>
> > Hi Jark and Dong,
> >
> > I fully understand your concerns in this case. I also think that this
> > situation is an exception. This discussion started just before the
> holiday
> > season in Europe started. The request to Chesnay if he had more comments
> > was sent just a couple of days before Christmas.
> >
> > From my perspective, given how the discussion started in the discussion
> > thread and the context that was provided, I would have sent an email that
> > if there are no more comments in the next 72 hours, you would open a vote
> > thread. Especially if someone raised a concern first. That has happened a
> > lot on other discussion threads as well, even when there were no more
> open
> > discussion topics.
> >
> > While I fully understand the disappointment from your point of view, the
> > other way around it feels disappointing that this was brought to a vote.
> So
> > let's use the disappointments from both ends to learn and to improve
> > overall. Something like:
> >
> > * If someone raises concerns during a discussion, of course first try to
> > resolve all concerns.
> > * If that person no longer participates in the discussion, send a
> > last-call-for-discussions in the discussion thread for 72 hours and else
> > you will open a vote thread
> > * Follow the regular voting process
> >
> > For this specific FLIP, I've briefly talked to Chesnay offline and I'm
> sure
> > he will participate soon to unblock it.
> >
> > Best regards,
> >
> > Martijn
> >
> > On Wed, Jan 11, 2023 at 1:47 PM Dong Lin  wrote:
> >
> > > Hi Martijn,
> > >
> > > A collaborative and active community is very important for any
> > open-source
> > > project. Flink can succeed today because we have many experienced and
> > > passionate developers who collaborate together to develop Flink. It is
> > > important that developers can give constructive feedback and help each
> > > other be productive.
> > >
> > > I find it kind of surprising and disappointing that we have to wait for
> > one
> > > particular developer for more than 30 days to get a reply before being
> > > allowed forward and making progress. It is hard to image what would
> > happen
> > > if every committer can take 30+ days to reply to a FLIP and still
> expects
> > > the FLIP to wait for the reply. Flink community will likely be dead if
> > this
> > > is the culture that Flink community uses to treat contributors.
> > >
> > > Could you explain how long we have to wait before making progress for
> > this
> > > FLIP? And in the future, what would be the resonable timeframe to wait
> > for
> > > a reply before we can open the voting thread?
> > >
> > > Thanks,
> > > Dong
> > >
> > >
> > > On Wed, Jan 11, 2023 at 4:17 PM Martijn Visser <
> martijnvis...@apache.org
> > >
> > > wrote:
> > >
> > > > -1 (binding) currently: I don't think this should have gone to a vote
> > yet
> > > > given that Chesnay deliberately mentioned that he would vote against
> it
> > > > as-is. The discussion should have been settled first.
> > > >
> > > > On Tue, Jan 10, 2023 at 10:51 AM Zhu Zhu  wrote:
> > > >
> > > > > +1 (binding)
> > > > >
> > > > > Thanks,
> > > > > Zhu
> > > > >
> > > > > Jark Wu  于2023年1月10日周二 17:43写道:
> > > > > >
> > > > > > +1 (binding)
> > > > > >
> > > > > > Best,
> > > > > > Jark
> > > > > >
> > > > > > > 2023年1月10日 12:02,Qingsheng Ren  写道:
> > > > > > >
> > 

Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Jark Wu
Hi Martijn,

Thanks for helping us to find Chesnay.
Could you clarify how "soon" will Chesnay participate? A day or a week?

I agree with your improvement proposal and Hang exactly follows your
points.
The last-call-for-discussion for Chensay has been sent for more than 72
hours,
even after the Christmas holiday. Therefore, from my point of view, this
FLIP
should have been passed. On the other hand, the Flink Bylaws[1] don't
require
voting to begin until a specific person responds. The 3 days (even work
days) voting
length already considers the committers' reaction time. If someone doesn't
vote -1
during the time period, which means he/she doesn't have concerns about the
proposal.
If you care about the proposal, PLEASE VOTE and explain the reasons. This
is how
the 200+ FLIPs work until today. I'm just very disappointed and surprised
we have to
wait for someone to vote for the FLIP who doesn't care about the FLIP for
more than a month.


Best,
Jark

[1]: https://cwiki.apache.org/confluence/display/FLINK/Flink+Bylaws

On Wed, 11 Jan 2023 at 21:14, Martijn Visser 
wrote:

> Hi Jark and Dong,
>
> I fully understand your concerns in this case. I also think that this
> situation is an exception. This discussion started just before the holiday
> season in Europe started. The request to Chesnay if he had more comments
> was sent just a couple of days before Christmas.
>
> From my perspective, given how the discussion started in the discussion
> thread and the context that was provided, I would have sent an email that
> if there are no more comments in the next 72 hours, you would open a vote
> thread. Especially if someone raised a concern first. That has happened a
> lot on other discussion threads as well, even when there were no more open
> discussion topics.
>
> While I fully understand the disappointment from your point of view, the
> other way around it feels disappointing that this was brought to a vote. So
> let's use the disappointments from both ends to learn and to improve
> overall. Something like:
>
> * If someone raises concerns during a discussion, of course first try to
> resolve all concerns.
> * If that person no longer participates in the discussion, send a
> last-call-for-discussions in the discussion thread for 72 hours and else
> you will open a vote thread
> * Follow the regular voting process
>
> For this specific FLIP, I've briefly talked to Chesnay offline and I'm sure
> he will participate soon to unblock it.
>
> Best regards,
>
> Martijn
>
> On Wed, Jan 11, 2023 at 1:47 PM Dong Lin  wrote:
>
> > Hi Martijn,
> >
> > A collaborative and active community is very important for any
> open-source
> > project. Flink can succeed today because we have many experienced and
> > passionate developers who collaborate together to develop Flink. It is
> > important that developers can give constructive feedback and help each
> > other be productive.
> >
> > I find it kind of surprising and disappointing that we have to wait for
> one
> > particular developer for more than 30 days to get a reply before being
> > allowed forward and making progress. It is hard to image what would
> happen
> > if every committer can take 30+ days to reply to a FLIP and still expects
> > the FLIP to wait for the reply. Flink community will likely be dead if
> this
> > is the culture that Flink community uses to treat contributors.
> >
> > Could you explain how long we have to wait before making progress for
> this
> > FLIP? And in the future, what would be the resonable timeframe to wait
> for
> > a reply before we can open the voting thread?
> >
> > Thanks,
> > Dong
> >
> >
> > On Wed, Jan 11, 2023 at 4:17 PM Martijn Visser  >
> > wrote:
> >
> > > -1 (binding) currently: I don't think this should have gone to a vote
> yet
> > > given that Chesnay deliberately mentioned that he would vote against it
> > > as-is. The discussion should have been settled first.
> > >
> > > On Tue, Jan 10, 2023 at 10:51 AM Zhu Zhu  wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > Thanks,
> > > > Zhu
> > > >
> > > > Jark Wu  于2023年1月10日周二 17:43写道:
> > > > >
> > > > > +1 (binding)
> > > > >
> > > > > Best,
> > > > > Jark
> > > > >
> > > > > > 2023年1月10日 12:02,Qingsheng Ren  写道:
> > > > > >
> > > > > > Thanks for the FLIP!
> > > > > >
> > > > > > +1 (binding)
> > > > > >
> > > > > > Best,
> > > > > > Qingsheng
> > > > > >
> > > > > > On Wed, Jan 4, 2023 at 10:08 AM Hang Ruan <
> ruanhang1...@gmail.com>
> > > > wrote:
> > > > > >
> > > > > >> Hi all,
> > > > > >>
> > > > > >> Thanks for all the feedback so far.
> > > > > >> Based on the discussion[1], we have come to a consensus, so I
> > would
> > > > like to
> > > > > >> start a vote on FLIP-274: Introduce metric group for
> > > > > >> OperatorCoordinator[2].
> > > > > >>
> > > > > >> The vote will last for at least 72 hours (Jan 7th at 11:00 GMT)
> > > unless
> > > > > >> there is an objection or insufficient votes.
> > > > > >>
> > > > > >> [1]
> > > 

[jira] [Created] (FLINK-30638) Operator doesn't forward allowNonRestoredState flag

2023-01-11 Thread Alexis Sarda-Espinosa (Jira)
Alexis Sarda-Espinosa created FLINK-30638:
-

 Summary: Operator doesn't forward allowNonRestoredState flag
 Key: FLINK-30638
 URL: https://issues.apache.org/jira/browse/FLINK-30638
 Project: Flink
  Issue Type: Bug
  Components: Kubernetes Operator
Affects Versions: kubernetes-operator-1.3.1
Reporter: Alexis Sarda-Espinosa
 Attachments: flink-kubernetes-operator_json.txt

My Flink image is currently based on 1.15, and I have HA enabled. After an 
upgrade where I removed some transformations from the pipeline, I see the 
following exception in the deployec Flink cluster:

Job 38585f46d840b3c2e9f843e230398d35 reached terminal state FAILED.
org.apache.flink.runtime.client.JobInitializationException: Could not start the 
JobMaster.
...
Caused by: java.util.concurrent.CompletionException: 
java.lang.IllegalStateException: There is no operator for the state 
d3bab17db5b56f81eb48b3e7d5cce15d

I'm attaching some of the operator's logs, but I don't see anything strange 
there.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [DISCUSS] FLIP-281 Sink Supports Speculative Execution For Batch Job

2023-01-11 Thread Martijn Visser
Hi Biao,

Apologies for the late jumping in. My only question is about SinkFunction,
does this imply that we want to add support for this to the SinkFunction?
If so, I would not be in favour of that since we would like to deprecate (I
actually thought that was already the case) the SinkFunction in favour of
SinkV2.

Besides that, I have no other comments.

Best regards,

Martijn

On Wed, Jan 4, 2023 at 7:28 AM Jing Zhang  wrote:

> Hi Biao,
>
> Thanks for explanation.
>
> +1 for the proposal.
>
> Best,
> Jing Zhang
>
> Lijie Wang  于2023年1月4日周三 12:11写道:
>
> > Hi Biao,
> >
> > Thanks for the explanation of how SinkV2  knows the right subtask
> > attempt. I have no more questions, +1 for the proposal.
> >
> > Best,
> > Lijie
> >
> > Biao Liu  于2022年12月28日周三 17:22写道:
> >
> > > Thanks for all your feedback!
> > >
> > > To @Yuxia,
> > >
> > > > What the sink expect to do to isolate data produced by speculative
> > > > executions?  IIUC, if the taks failover, it also generate a new
> > attempt.
> > > > Does it make difference in isolating data produced?
> > >
> > >
> > > Yes there is something different from the task failover scenario. The
> > > attempt number is more necessary for speculative execution than
> failover.
> > > Because there can be only one subtask instance running at the same time
> > in
> > > the failover scenario.
> > >
> > > Let's take FileSystemOutputFormat as an example. For the failover
> > scenario,
> > > the temporary directory to store produced data can be something like
> > > "$root_dir/task-$taskNumber/". At the initialization phase, subtask
> > deletes
> > > and re-creates the temporary directory.
> > >
> > > However in the speculative execution scenario, it does not work because
> > > there might be several subtasks running at the same time. These
> subtasks
> > > might delete, re-create and write the same temporary directory at the
> > > same time. The correct temporary directory should be like
> > > "$root_dir/task-$taskNumber/attempt-$attemptNumber". So it's necessary
> to
> > > expose the attempt number to the Sink implementation to do the data
> > > isolation.
> > >
> > >
> > > To @Lijie,
> > >
> > > > I have a question about this: does SinkV2 need to do the same thing?
> > >
> > >
> > > Actually, yes.
> > >
> > > Should we/users do it in the committer? If yes, how does the commiter
> > know
> > > > which one is the right subtask attempt?
> > >
> > >
> > > Yes, we/users should do it in the committer.
> > >
> > > In the current design, the Committer of Sink V2 should get the "which
> one
> > > is the right subtask attempt" information from the "committable data''
> > > produced by SinkWriter. Let's take the FileSink as example, the
> > > "committable data" sent to the Committer contains the full path of the
> > > files produced by SinkWriter. Users could also pass the attempt number
> > > through "committable data" from SinkWriter to Committer.
> > >
> > > In the "Rejected Alternatives -> Introduce a way to clean leaked data
> of
> > > Sink V2" section of the FLIP document, we discussed some of the reasons
> > > that we didn't provide the API like OutputFormat.
> > >
> > > To @Jing Zhang
> > >
> > > I have a question about this: Speculative execution of Committer will
> be
> > > > disabled.
> > >
> > > I agree with your point and I saw the similar requirements to disable
> > > speculative
> > > > execution for specified operators.
> > >
> > > However the requirement is not supported currently. I think there
> > > should be some
> > > > place to describe how to support it.
> > >
> > >
> > > In this FLIP design, the speculative execution of Committer of Sink V2
> > will
> > > be disabled by Flink. It's not an optional operation. Users can not
> > change
> > > it.
> > > And as you said, "disable speculative execution for specified
> operators"
> > is
> > > not supported in the FLIP. Because it's a bit out of scope: "Sink
> > Supports
> > > Speculative Execution For Batch Job". I think it's better to start
> > another
> > > FLIP to discuss it. "Fine-grained control of enabling speculative
> > execution
> > > for operators" can be the title of that FLIP. And we can discuss there
> > how
> > > to enable or disable speculative execution for specified operators
> > > including Committer and pre/post-committer of Sink V2.
> > >
> > > What do you think?
> > >
> > > Thanks,
> > > Biao /'bɪ.aʊ/
> > >
> > >
> > >
> > > On Wed, 28 Dec 2022 at 11:30, Jing Zhang  wrote:
> > >
> > > > Hi Biao,
> > > >
> > > > Thanks for driving this FLIP. It's meaningful to support speculative
> > > > execution
> > > > of sinks is important.
> > > >
> > > > I have a question about this: Speculative execution of Committer will
> > be
> > > > disabled.
> > > >
> > > > I agree with your point and I saw the similar requirements to disable
> > > > speculative execution for specified operators.
> > > >
> > > > However the requirement is not supported currently. I think there
> > should
> > > be
> > > > some place to 

Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Martijn Visser
Congratulations Lincoln, happy to have you on board!

Best regards, Martijn


On Wed, Jan 11, 2023 at 1:49 PM Dong Lin  wrote:

> Congratulations, Lincoln!
>
> Cheers,
> Dong
>
> On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
>
> > Hi everyone,
> >
> > On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new
> Flink
> > committer.
> >
> > Lincoln Lee has been a long-term Flink contributor since 2017. He mainly
> > works on Flink
> > SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry Async
> > I/O), FLIP-234 (
> > Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He also
> > contributed
> > much to Streaming Semantics, including the non-determinism problem and
> the
> > message
> > ordering problem.
> >
> > Please join me in congratulating Lincoln for becoming a Flink committer!
> >
> > Cheers,
> > Jark Wu
> >
>


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Martijn Visser
Hi Jark and Dong,

I fully understand your concerns in this case. I also think that this
situation is an exception. This discussion started just before the holiday
season in Europe started. The request to Chesnay if he had more comments
was sent just a couple of days before Christmas.

>From my perspective, given how the discussion started in the discussion
thread and the context that was provided, I would have sent an email that
if there are no more comments in the next 72 hours, you would open a vote
thread. Especially if someone raised a concern first. That has happened a
lot on other discussion threads as well, even when there were no more open
discussion topics.

While I fully understand the disappointment from your point of view, the
other way around it feels disappointing that this was brought to a vote. So
let's use the disappointments from both ends to learn and to improve
overall. Something like:

* If someone raises concerns during a discussion, of course first try to
resolve all concerns.
* If that person no longer participates in the discussion, send a
last-call-for-discussions in the discussion thread for 72 hours and else
you will open a vote thread
* Follow the regular voting process

For this specific FLIP, I've briefly talked to Chesnay offline and I'm sure
he will participate soon to unblock it.

Best regards,

Martijn

On Wed, Jan 11, 2023 at 1:47 PM Dong Lin  wrote:

> Hi Martijn,
>
> A collaborative and active community is very important for any open-source
> project. Flink can succeed today because we have many experienced and
> passionate developers who collaborate together to develop Flink. It is
> important that developers can give constructive feedback and help each
> other be productive.
>
> I find it kind of surprising and disappointing that we have to wait for one
> particular developer for more than 30 days to get a reply before being
> allowed forward and making progress. It is hard to image what would happen
> if every committer can take 30+ days to reply to a FLIP and still expects
> the FLIP to wait for the reply. Flink community will likely be dead if this
> is the culture that Flink community uses to treat contributors.
>
> Could you explain how long we have to wait before making progress for this
> FLIP? And in the future, what would be the resonable timeframe to wait for
> a reply before we can open the voting thread?
>
> Thanks,
> Dong
>
>
> On Wed, Jan 11, 2023 at 4:17 PM Martijn Visser 
> wrote:
>
> > -1 (binding) currently: I don't think this should have gone to a vote yet
> > given that Chesnay deliberately mentioned that he would vote against it
> > as-is. The discussion should have been settled first.
> >
> > On Tue, Jan 10, 2023 at 10:51 AM Zhu Zhu  wrote:
> >
> > > +1 (binding)
> > >
> > > Thanks,
> > > Zhu
> > >
> > > Jark Wu  于2023年1月10日周二 17:43写道:
> > > >
> > > > +1 (binding)
> > > >
> > > > Best,
> > > > Jark
> > > >
> > > > > 2023年1月10日 12:02,Qingsheng Ren  写道:
> > > > >
> > > > > Thanks for the FLIP!
> > > > >
> > > > > +1 (binding)
> > > > >
> > > > > Best,
> > > > > Qingsheng
> > > > >
> > > > > On Wed, Jan 4, 2023 at 10:08 AM Hang Ruan 
> > > wrote:
> > > > >
> > > > >> Hi all,
> > > > >>
> > > > >> Thanks for all the feedback so far.
> > > > >> Based on the discussion[1], we have come to a consensus, so I
> would
> > > like to
> > > > >> start a vote on FLIP-274: Introduce metric group for
> > > > >> OperatorCoordinator[2].
> > > > >>
> > > > >> The vote will last for at least 72 hours (Jan 7th at 11:00 GMT)
> > unless
> > > > >> there is an objection or insufficient votes.
> > > > >>
> > > > >> [1]
> > https://lists.apache.org/thread/63m9w60rndqnrqvgb6qosvt2bcbww53k
> > > > >> [2]
> > > > >>
> > > > >>
> > >
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-274%3A+Introduce+metric+group+for+OperatorCoordinator
> > > > >>
> > > > >> Best,
> > > > >> Hang
> > > > >>
> > > >
> > >
> >
>


[jira] [Created] (FLINK-30637) In linux-aarch64 environment, using “is” judgment to match the window type of overwindow have returned incorrect matching results

2023-01-11 Thread Xin Chen (Jira)
Xin Chen created FLINK-30637:


 Summary: In linux-aarch64 environment, using “is” judgment to 
match the window type of overwindow have returned incorrect matching results
 Key: FLINK-30637
 URL: https://issues.apache.org/jira/browse/FLINK-30637
 Project: Flink
  Issue Type: Bug
  Components: API / Python
Affects Versions: 1.13.6
 Environment: Linux version 5.10.0-60.18.0.50.oe2203.aarch64

(abuild@obs-worker-002) (gcc_old (GCC) 10.3.1, GNU ld (GNU Binutils) 2.37)

#1 SMP Wed Mar 30 02:43:08 UTC 2022

 

pyflink-version:1.13.6
Reporter: Xin Chen
 Fix For: 1.13.6


In  linux-arch64 environment, “window_type is 
OverWindow.ROW_UNBOUNDED_FOLLOWING” in  in the 
PandasBatchOverWindowAggregateFunctionOperation class of the pyflink source 
code has returned the wrong result.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Dong Lin
Congratulations, Lincoln!

Cheers,
Dong

On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:

> Hi everyone,
>
> On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new Flink
> committer.
>
> Lincoln Lee has been a long-term Flink contributor since 2017. He mainly
> works on Flink
> SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry Async
> I/O), FLIP-234 (
> Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He also
> contributed
> much to Streaming Semantics, including the non-determinism problem and the
> message
> ordering problem.
>
> Please join me in congratulating Lincoln for becoming a Flink committer!
>
> Cheers,
> Jark Wu
>


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Dong Lin
Hi Martijn,

A collaborative and active community is very important for any open-source
project. Flink can succeed today because we have many experienced and
passionate developers who collaborate together to develop Flink. It is
important that developers can give constructive feedback and help each
other be productive.

I find it kind of surprising and disappointing that we have to wait for one
particular developer for more than 30 days to get a reply before being
allowed forward and making progress. It is hard to image what would happen
if every committer can take 30+ days to reply to a FLIP and still expects
the FLIP to wait for the reply. Flink community will likely be dead if this
is the culture that Flink community uses to treat contributors.

Could you explain how long we have to wait before making progress for this
FLIP? And in the future, what would be the resonable timeframe to wait for
a reply before we can open the voting thread?

Thanks,
Dong


On Wed, Jan 11, 2023 at 4:17 PM Martijn Visser 
wrote:

> -1 (binding) currently: I don't think this should have gone to a vote yet
> given that Chesnay deliberately mentioned that he would vote against it
> as-is. The discussion should have been settled first.
>
> On Tue, Jan 10, 2023 at 10:51 AM Zhu Zhu  wrote:
>
> > +1 (binding)
> >
> > Thanks,
> > Zhu
> >
> > Jark Wu  于2023年1月10日周二 17:43写道:
> > >
> > > +1 (binding)
> > >
> > > Best,
> > > Jark
> > >
> > > > 2023年1月10日 12:02,Qingsheng Ren  写道:
> > > >
> > > > Thanks for the FLIP!
> > > >
> > > > +1 (binding)
> > > >
> > > > Best,
> > > > Qingsheng
> > > >
> > > > On Wed, Jan 4, 2023 at 10:08 AM Hang Ruan 
> > wrote:
> > > >
> > > >> Hi all,
> > > >>
> > > >> Thanks for all the feedback so far.
> > > >> Based on the discussion[1], we have come to a consensus, so I would
> > like to
> > > >> start a vote on FLIP-274: Introduce metric group for
> > > >> OperatorCoordinator[2].
> > > >>
> > > >> The vote will last for at least 72 hours (Jan 7th at 11:00 GMT)
> unless
> > > >> there is an objection or insufficient votes.
> > > >>
> > > >> [1]
> https://lists.apache.org/thread/63m9w60rndqnrqvgb6qosvt2bcbww53k
> > > >> [2]
> > > >>
> > > >>
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-274%3A+Introduce+metric+group+for+OperatorCoordinator
> > > >>
> > > >> Best,
> > > >> Hang
> > > >>
> > >
> >
>


Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Jun Qin
Congrats, Lincoln! Well deserved!

> On Jan 11, 2023, at 11:23 AM, Xintong Song  wrote:
> 
> Congratulations, Lincoln! Well deserved.
> 
> Best,
> 
> Xintong
> 
> 
> 
> On Wed, Jan 11, 2023 at 5:48 PM Matthias Pohl
>  wrote:
> 
>> Congratulations, Lincoln!
>> 
>> On Wed, Jan 11, 2023 at 10:17 AM Shuo Cheng  wrote:
>> 
>>> Congratulations, Lincoln!
>>> 
>>> On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
>>> 
 Hi everyone,
 
 On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new
>>> Flink
 committer.
 
 Lincoln Lee has been a long-term Flink contributor since 2017. He
>> mainly
 works on Flink
 SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry
>> Async
 I/O), FLIP-234 (
 Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He
>> also
 contributed
 much to Streaming Semantics, including the non-determinism problem and
>>> the
 message
 ordering problem.
 
 Please join me in congratulating Lincoln for becoming a Flink
>> committer!
 
 Cheers,
 Jark Wu
 
>>> 
>> 



Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Xintong Song
Congratulations, Lincoln! Well deserved.

Best,

Xintong



On Wed, Jan 11, 2023 at 5:48 PM Matthias Pohl
 wrote:

> Congratulations, Lincoln!
>
> On Wed, Jan 11, 2023 at 10:17 AM Shuo Cheng  wrote:
>
> > Congratulations, Lincoln!
> >
> > On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
> >
> > > Hi everyone,
> > >
> > > On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new
> > Flink
> > > committer.
> > >
> > > Lincoln Lee has been a long-term Flink contributor since 2017. He
> mainly
> > > works on Flink
> > > SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry
> Async
> > > I/O), FLIP-234 (
> > > Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He
> also
> > > contributed
> > > much to Streaming Semantics, including the non-determinism problem and
> > the
> > > message
> > > ordering problem.
> > >
> > > Please join me in congratulating Lincoln for becoming a Flink
> committer!
> > >
> > > Cheers,
> > > Jark Wu
> > >
> >
>


[jira] [Created] (FLINK-30636) Typo fix: "to to" -> "to"

2023-01-11 Thread Gunnar Morling (Jira)
Gunnar Morling created FLINK-30636:
--

 Summary: Typo fix: "to to" -> "to"
 Key: FLINK-30636
 URL: https://issues.apache.org/jira/browse/FLINK-30636
 Project: Flink
  Issue Type: Technical Debt
Reporter: Gunnar Morling


There's a surprising number of occurrences of "to to" in JavaDoc and the like, 
where it actually should just be "to".



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Matthias Pohl
Congratulations, Lincoln!

On Wed, Jan 11, 2023 at 10:17 AM Shuo Cheng  wrote:

> Congratulations, Lincoln!
>
> On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:
>
> > Hi everyone,
> >
> > On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new
> Flink
> > committer.
> >
> > Lincoln Lee has been a long-term Flink contributor since 2017. He mainly
> > works on Flink
> > SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry Async
> > I/O), FLIP-234 (
> > Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He also
> > contributed
> > much to Streaming Semantics, including the non-determinism problem and
> the
> > message
> > ordering problem.
> >
> > Please join me in congratulating Lincoln for becoming a Flink committer!
> >
> > Cheers,
> > Jark Wu
> >
>


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Jark Wu
Hi Martijn,

I think we should have a reasonable time window to wait for the discussion
response.
Many proposals will halt if we have to wait for every reply indefinitely.
Chesnay raised some concerns in the thread one month ago and no objections
or feedback after that.
Hang replied to all the questions and others also think the concerns were
addressed.
Hang also announced waiting for the final discussion feedback on 26 Dec
2022,
but no feedback was received from Chesnay before the vote started on 4 Jan
2023.
So I think the discussion has been settled.

I think we shouldn't block a FLIP indefinitely. Either having a clear
waiting time or continuing the development of the FLIP.
In the end, we still have a chance to revisit the FLIP before the 1.17
release if Chesnay comes back to the community.

Best,
Jark

On Wed, 11 Jan 2023 at 16:17, Martijn Visser 
wrote:

> -1 (binding) currently: I don't think this should have gone to a vote yet
> given that Chesnay deliberately mentioned that he would vote against it
> as-is. The discussion should have been settled first.
>
> On Tue, Jan 10, 2023 at 10:51 AM Zhu Zhu  wrote:
>
> > +1 (binding)
> >
> > Thanks,
> > Zhu
> >
> > Jark Wu  于2023年1月10日周二 17:43写道:
> > >
> > > +1 (binding)
> > >
> > > Best,
> > > Jark
> > >
> > > > 2023年1月10日 12:02,Qingsheng Ren  写道:
> > > >
> > > > Thanks for the FLIP!
> > > >
> > > > +1 (binding)
> > > >
> > > > Best,
> > > > Qingsheng
> > > >
> > > > On Wed, Jan 4, 2023 at 10:08 AM Hang Ruan 
> > wrote:
> > > >
> > > >> Hi all,
> > > >>
> > > >> Thanks for all the feedback so far.
> > > >> Based on the discussion[1], we have come to a consensus, so I would
> > like to
> > > >> start a vote on FLIP-274: Introduce metric group for
> > > >> OperatorCoordinator[2].
> > > >>
> > > >> The vote will last for at least 72 hours (Jan 7th at 11:00 GMT)
> unless
> > > >> there is an objection or insufficient votes.
> > > >>
> > > >> [1]
> https://lists.apache.org/thread/63m9w60rndqnrqvgb6qosvt2bcbww53k
> > > >> [2]
> > > >>
> > > >>
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-274%3A+Introduce+metric+group+for+OperatorCoordinator
> > > >>
> > > >> Best,
> > > >> Hang
> > > >>
> > >
> >
>


[jira] [Created] (FLINK-30635) Fail to recover for row-format when writing to OSS using FileSystemTable connector

2023-01-11 Thread luoyuxia (Jira)
luoyuxia created FLINK-30635:


 Summary: Fail to recover for row-format when writing to OSS using 
FileSystemTable connector
 Key: FLINK-30635
 URL: https://issues.apache.org/jira/browse/FLINK-30635
 Project: Flink
  Issue Type: Bug
  Components: Connectors / FileSystem
Reporter: luoyuxia






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-30634) HiveTableSourceITCase.setupSourceDatabaseAndData fails with InvalidObjectException

2023-01-11 Thread Matthias Pohl (Jira)
Matthias Pohl created FLINK-30634:
-

 Summary: HiveTableSourceITCase.setupSourceDatabaseAndData fails 
with InvalidObjectException
 Key: FLINK-30634
 URL: https://issues.apache.org/jira/browse/FLINK-30634
 Project: Flink
  Issue Type: Bug
  Components: Connectors / Hive
Affects Versions: 1.16.0
Reporter: Matthias Pohl


We have multiple failures in HiveTableSourceITCase

https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=44692=logs=245e1f2e-ba5b-5570-d689-25ae21e5302f=d04c9862-880c-52f5-574b-a7a79fef8e0f=27933



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [ANNOUNCE] New Apache Flink Committer - Lincoln Lee

2023-01-11 Thread Shuo Cheng
Congratulations, Lincoln!

On Tue, Jan 10, 2023 at 11:52 AM Jark Wu  wrote:

> Hi everyone,
>
> On behalf of the PMC, I'm very happy to announce Lincoln Lee as a new Flink
> committer.
>
> Lincoln Lee has been a long-term Flink contributor since 2017. He mainly
> works on Flink
> SQL parts and drives several important FLIPs, e.g., FLIP-232 (Retry Async
> I/O), FLIP-234 (
> Retryable Lookup Join), FLIP-260 (TableFunction Finish). Besides, He also
> contributed
> much to Streaming Semantics, including the non-determinism problem and the
> message
> ordering problem.
>
> Please join me in congratulating Lincoln for becoming a Flink committer!
>
> Cheers,
> Jark Wu
>


[jira] [Created] (FLINK-30633) Update AWS SDKv2 to v2.19.14

2023-01-11 Thread Danny Cranmer (Jira)
Danny Cranmer created FLINK-30633:
-

 Summary: Update AWS SDKv2 to v2.19.14
 Key: FLINK-30633
 URL: https://issues.apache.org/jira/browse/FLINK-30633
 Project: Flink
  Issue Type: Improvement
  Components: Connectors / AWS, Connectors / DynamoDB, Connectors / 
Firehose, Connectors / Kinesis
Reporter: Danny Cranmer
 Fix For: 1.17.0, 1.16.1, 1.15.4, aws-connector-4.1.0, 
aws-connector-3.1.0


Update AWS SDK to latest version since it fixes a bug:

- https://issues.apache.org/jira/browse/FLINK-30304



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-30632) Introduce DataType for table store

2023-01-11 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-30632:


 Summary: Introduce DataType for table store
 Key: FLINK-30632
 URL: https://issues.apache.org/jira/browse/FLINK-30632
 Project: Flink
  Issue Type: Sub-task
  Components: Table Store
Reporter: Jingsong Lee
Assignee: Jingsong Lee
 Fix For: table-store-0.4.0


Introduce table store own DataType to decouple Flink SQL LogicalType.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-30631) Limit the max number of subpartitons consumed by each downstream task

2023-01-11 Thread Lijie Wang (Jira)
Lijie Wang created FLINK-30631:
--

 Summary: Limit the max number of subpartitons consumed by each 
downstream task
 Key: FLINK-30631
 URL: https://issues.apache.org/jira/browse/FLINK-30631
 Project: Flink
  Issue Type: Sub-task
  Components: Runtime / Coordination
Reporter: Lijie Wang
 Fix For: 1.17.0


In the current implementation(FLINK-25035), when the upstream vertex 
parallelism is much greater than the downstream vertex parallelism, it may lead 
to a large number of channels in the downstream tasks(for example, A -> B, all 
to all edge, max parallelism is 1000. If parallelism of A is 1000, parallelism 
of B is decided to be 1, then the only subtask of B will consume 1000 * 1000 
subpartitions), resulting in a large overhead for processing channels.

In this ticket, we temporarily address this issue by limiting the max number of 
subpartitons consumed by each downstream task. The ultimate solution should be 
to support single channel consume multiple subpartitons.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-30630) Python: BatchModeDataStreamTests.test_keyed_map failed

2023-01-11 Thread Matthias Pohl (Jira)
Matthias Pohl created FLINK-30630:
-

 Summary: Python: BatchModeDataStreamTests.test_keyed_map failed
 Key: FLINK-30630
 URL: https://issues.apache.org/jira/browse/FLINK-30630
 Project: Flink
  Issue Type: Bug
  Components: API / Python
Affects Versions: 1.15.3
Reporter: Matthias Pohl


{{BatchModeDataStreamTests.test_keyed_map}] failed in 1.15:
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=44691=logs=821b528f-1eed-5598-a3b4-7f748b13f261=6bb545dd-772d-5d8c-f258-f5085fba3295



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-30629) ClientHeartbeatTest.testJobRunningIfClientReportHeartbeat is unstable

2023-01-11 Thread Xintong Song (Jira)
Xintong Song created FLINK-30629:


 Summary: ClientHeartbeatTest.testJobRunningIfClientReportHeartbeat 
is unstable
 Key: FLINK-30629
 URL: https://issues.apache.org/jira/browse/FLINK-30629
 Project: Flink
  Issue Type: Bug
  Components: Client / Job Submission
Affects Versions: 1.17.0
Reporter: Xintong Song
 Fix For: 1.17.0


https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=44690=logs=77a9d8e1-d610-59b3-fc2a-4766541e0e33=125e07e7-8de0-5c6c-a541-a567415af3ef=10819

{code:java}
Jan 11 04:32:39 [ERROR] Tests run: 3, Failures: 0, Errors: 1, Skipped: 0, Time 
elapsed: 21.02 s <<< FAILURE! - in org.apache.flink.client.ClientHeartbeatTest
Jan 11 04:32:39 [ERROR] 
org.apache.flink.client.ClientHeartbeatTest.testJobRunningIfClientReportHeartbeat
  Time elapsed: 9.157 s  <<< ERROR!
Jan 11 04:32:39 java.lang.IllegalStateException: MiniCluster is not yet running 
or has already been shut down.
Jan 11 04:32:39 at 
org.apache.flink.util.Preconditions.checkState(Preconditions.java:193)
Jan 11 04:32:39 at 
org.apache.flink.runtime.minicluster.MiniCluster.getDispatcherGatewayFuture(MiniCluster.java:1044)
Jan 11 04:32:39 at 
org.apache.flink.runtime.minicluster.MiniCluster.runDispatcherCommand(MiniCluster.java:917)
Jan 11 04:32:39 at 
org.apache.flink.runtime.minicluster.MiniCluster.getJobStatus(MiniCluster.java:841)
Jan 11 04:32:39 at 
org.apache.flink.runtime.minicluster.MiniClusterJobClient.getJobStatus(MiniClusterJobClient.java:91)
Jan 11 04:32:39 at 
org.apache.flink.client.ClientHeartbeatTest.testJobRunningIfClientReportHeartbeat(ClientHeartbeatTest.java:79)
{code}




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [VOTE] FLIP-274: Introduce metric group for OperatorCoordinator

2023-01-11 Thread Martijn Visser
-1 (binding) currently: I don't think this should have gone to a vote yet
given that Chesnay deliberately mentioned that he would vote against it
as-is. The discussion should have been settled first.

On Tue, Jan 10, 2023 at 10:51 AM Zhu Zhu  wrote:

> +1 (binding)
>
> Thanks,
> Zhu
>
> Jark Wu  于2023年1月10日周二 17:43写道:
> >
> > +1 (binding)
> >
> > Best,
> > Jark
> >
> > > 2023年1月10日 12:02,Qingsheng Ren  写道:
> > >
> > > Thanks for the FLIP!
> > >
> > > +1 (binding)
> > >
> > > Best,
> > > Qingsheng
> > >
> > > On Wed, Jan 4, 2023 at 10:08 AM Hang Ruan 
> wrote:
> > >
> > >> Hi all,
> > >>
> > >> Thanks for all the feedback so far.
> > >> Based on the discussion[1], we have come to a consensus, so I would
> like to
> > >> start a vote on FLIP-274: Introduce metric group for
> > >> OperatorCoordinator[2].
> > >>
> > >> The vote will last for at least 72 hours (Jan 7th at 11:00 GMT) unless
> > >> there is an objection or insufficient votes.
> > >>
> > >> [1] https://lists.apache.org/thread/63m9w60rndqnrqvgb6qosvt2bcbww53k
> > >> [2]
> > >>
> > >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-274%3A+Introduce+metric+group+for+OperatorCoordinator
> > >>
> > >> Best,
> > >> Hang
> > >>
> >
>


Re: [DISCUSS] FLIP-274 : Introduce metric group for OperatorCoordinator

2023-01-11 Thread Martijn Visser
Hi Hang,

I'm a bit surprised that this has gone to a vote, given that Chesnay
deliberately mentioned that he would vote against it as-is. I would expect
that before going to a vote, he has had the opportunity to participate in
this discussion.

Best regards,

Martijn

On Tue, Jan 3, 2023 at 12:53 PM Jark Wu  wrote:

> Hi Dong,
>
> Regarding “SplitEnumeratorContext#metricGroup”, my only concern is that
> this is a core interface for the FLIP. It’s hard to tell how sources use
> metric group without mentioning this interface. Even if this is an existing
> API, I think it’s worth introducing the interface again and declaring that
> we will implement the interface instead of a no-op method in this FLIP.
>
> Anyway, this is a minor problem and shouldn’t block this FLIP. I’m +1 to
> start a vote.
>
> Best,
> Jark
>
>
> > 2023年1月3日 10:03,Hang Ruan  写道:
> >
> > Hi, Jark and Dong,
> >
> > Thanks for your comments. Sorry for my late reply.
> >
> > For suggestion 1, I plan to implement the SplitEnumeratorMetricGroup in
> > another issue, and it is not contained in this FLIP. I will add some
> > description about this part.
> > For suggestion 2, changes about OperatorCoordinator#metricGroup has
> already
> > been documented in the proposed change section.
> >
> > Best,
> > Hang
> >
> > Dong Lin  于2023年1月1日周日 09:45写道:
> >
> >> Let me chime-in and add comments regarding the public interface section.
> >> Please see my comments inline.
> >>
> >> On Thu, Dec 29, 2022 at 6:08 PM Jark Wu  wrote:
> >>
> >>> Hi Hang,
> >>>
> >>> Thanks for driving this discussion. I think this is a very useful
> feature
> >>> for connectors.
> >>>
> >>> The FLIP looks quite good to me, and I just have two suggestions.
> >>>
> >>> 1. In the "Public Interface" section, mention that the implementation
> >>> behavior of "SplitEnumeratorContext#metricGroup" is changed from
> >> returning
> >>> null to returning a concrete SplitEnumeratorMetricGroup instance. Even
> >>> though the API is already there, the behavior change can also be
> >> considered
> >>> a public change.
> >>>
> >>
> >> SplitEnumeratorContext#metricGroup is an interface and this FLIP does
> not
> >> seem to change its semantics/behavior. The FLIP does change the
> >> implementation/behavior of SourceCoordinatorContext#metricGroup, which
> is
> >> marked @Internal.
> >>
> >> Thus it might seem a bit weird to add in the public interface section
> >> saying that we change the interface SplitEnumeratorContext#metricGroup
> from
> >> returning null to non-null object.
> >>
> >> 2. Mention the newly added interface of
> "OperatorCoordinator#metricGroup"
> >>> in the "Proposed Changes" section or "Public Interface" section. As the
> >>> FLIP said, OperatorCoordinator is widely used in many connectors.
> Though
> >> it
> >>> is still an @Internal API, I think it is worth mentioning the change in
> >> the
> >>> FLIP.
> >>
> >>
> >> Since OperatorCoordinator is an internal API, it seems reasonable to
> >> explain it in the proposed change section. The FLIP seems to have
> >> documented this in point 5 of the proposed change section.
> >>
> >> BTW, if we think there are @internal classes that are important enough
> to
> >> be added in the public interface section, it might be useful to
> explicitly
> >> discuss this topic and document it in the "*What are the "public
> >> interfaces" of the project*" in this
> >> <
> >>
> https://cwiki.apache.org/confluence/display/FLINK/Flink+Improvement+Proposals
> >>>
> >> wiki.
> >>
> >>
> >>> Best,
> >>> Jark
> >>>
> >>>
> >>> On Mon, 26 Dec 2022 at 18:06, Hang Ruan 
> wrote:
> >>>
>  Hi, thanks for the feedback, Zhu Zhu and Qingsheng.
>  After combining everyone's comments, the main concerns and
> >> corresponding
>  adjustments are as follows.
> 
>  Q1: Common metrics are not quite useful.
>  numEventsIn and numEventsOut counters will be removed from the
>  OperatorCoordinatorMetricGroup. These common metrics do not provide
> >>> enough
>  information for users. The users are more willing to get the number of
>  events of the specified type instead of the total number. And this
> >> metric
>  is calculated differently. The implementation could register the
> metric
> >>> by
>  themselves.
> 
>  Q2: This FLIP is overly complicated.
>  This FLIP will become concise after these modifications.
>  OperatorCoordinatorMetricGroup has already been introduced into Flink
> >> by
>  FLIP-179<
> 
> 
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-179%3A+Expose+Standardized+Operator+Metrics
> > .
>  And
>  this FLIP will not change it. This FLIP only provides a new metric
> >> option
>  and a new metric group scope. The changes in proposed changes provide
> >> the
>  details about the modifications for the internal classes, which might
> >>> make
>  it look complicated.
> 
>  Thanks for all the comments again. If there are no further