Hi Kumar,
Note that once the job restarts, the TMs will be up and running again, so
seeing them active after the problem isn’t much of an indicator.
Look in all of the Task Manager log files for any hints of what was happening
just before the time of the IOException.
The two minutes (120 secon
Hi all,
I’ve deployed a job on my Flink cluster which has 3 task managers and 2 job
managers.
Like clockwork, every two minutes the job restarts with the following error.
java.io.IOException: Connecting the channel failed: Connecting to remote task
manager + 'flink2-0.high.ue1.pre.aws.cloud.ab