Re: Flink job recovery after task manager failure

2022-03-03 Thread yidan zhao
dleStore{namespace='flink/default/checkpoints/523f9e48274186bb97c13e3c2213be0e'}. > > 2022-02-24 12:20:16,712 INFO > org.apache.flink.runtime.checkpoint.DefaultCompletedCheckpointStore > [] - All 1 checkpoints found are already downloaded. > > 2022-02-24 12:20:16,712 INFO > o

Re: Flink job recovery after task manager failure

2022-03-02 Thread Afek, Ifat (Nokia - IL/Kfar Sava)
- No master state to restore 2 Thanks, Ifat From: yidan zhao Date: Wednesday, 2 March 2022 at 4:08 To: "Afek, Ifat (Nokia - IL/Kfar Sava)" Cc: zhlonghong , "user@flink.apache.org" Subject: Re: Flink job recovery after task manager failure State backend can be set a

Re: Flink job recovery after task manager failure

2022-03-01 Thread yidan zhao
ers? Is > there another option? > > > > Thanks, > > Ifat > > > > *From: *Zhilong Hong > *Date: *Thursday, 24 February 2022 at 19:58 > *To: *"Afek, Ifat (Nokia - IL/Kfar Sava)" > *Cc: *"user@flink.apache.org" > *Subject: *Re: Flink job rec

Re: Flink job recovery after task manager failure

2022-03-01 Thread Afek, Ifat (Nokia - IL/Kfar Sava)
system be shared between the task managers and job managers? Is there another option? Thanks, Ifat From: Zhilong Hong Date: Thursday, 24 February 2022 at 19:58 To: "Afek, Ifat (Nokia - IL/Kfar Sava)" Cc: "user@flink.apache.org" Subject: Re: Flink job recovery after task

Re: Flink job recovery after task manager failure

2022-02-25 Thread Afek, Ifat (Nokia - IL/Kfar Sava)
Hi Zhilong, I will check the issues you raised. Thanks for your help, Ifat From: Zhilong Hong Date: Thursday, 24 February 2022 at 19:58 To: "Afek, Ifat (Nokia - IL/Kfar Sava)" Cc: "user@flink.apache.org" Subject: Re: Flink job recovery after task manager failure Hi, Afe

Re: Flink job recovery after task manager failure

2022-02-24 Thread Zhilong Hong
gt; > > > > Thanks, > > Ifat > > > > *From: *Zhilong Hong > *Date: *Wednesday, 23 February 2022 at 19:38 > *To: *"Afek, Ifat (Nokia - IL/Kfar Sava)" > *Cc: *"user@flink.apache.org" > *Subject: *Re: Flink job recovery after

Re: Flink job recovery after task manager failure

2022-02-24 Thread Afek, Ifat (Nokia - IL/Kfar Sava)
t; Subject: Re: Flink job recovery after task manager failure Hi, Afek! When a TaskManager is killed, JobManager will not be acknowledged until a heartbeat timeout happens. Currently, the default value of heartbeat.timeout is 50 seconds [1]. That's why it takes more than 30 seconds for Fl

Re: Flink job recovery after task manager failure

2022-02-23 Thread Zhilong Hong
Hi, Afek! When a TaskManager is killed, JobManager will not be acknowledged until a heartbeat timeout happens. Currently, the default value of heartbeat.timeout is 50 seconds [1]. That's why it takes more than 30 seconds for Flink to trigger a failover. If you'd like to shorten the time a