Re: Troubleshooting java.io.IOException: Connecting the channel failed: Connecting to remote task manager has failed. This might indicate that the remote task manager has been lost.

2019-06-10 Thread Ken Krugler
Hi Kumar, Note that once the job restarts, the TMs will be up and running again, so seeing them active after the problem isn’t much of an indicator. Look in all of the Task Manager log files for any hints of what was happening just before the time of the IOException. The two minutes (120 secon

Troubleshooting java.io.IOException: Connecting the channel failed: Connecting to remote task manager has failed. This might indicate that the remote task manager has been lost.

2019-06-10 Thread Kumar Bolar, Harshith
Hi all, I’ve deployed a job on my Flink cluster which has 3 task managers and 2 job managers. Like clockwork, every two minutes the job restarts with the following error. java.io.IOException: Connecting the channel failed: Connecting to remote task manager + 'flink2-0.high.ue1.pre.aws.cloud.ab