Re: How does JobManager terminate dangling task manager

2021-05-13 Thread Xintong Song
Hi narasimha, For each TaskManager, there are two kinds of connections to the JobManager process. - One single connection to the ResourceManager, which allows RM to monitor the slots' availability and assign them to Flink jobs. - Connections to each JobMaster that the slots of this TM are

Re: How does JobManager terminate dangling task manager

2021-05-13 Thread Guowei Ma
Hi, In fact, not only JobManager(ResoruceManager) will kill TimeOut's TaskManager, but if TaskManager finds that it cannot connect to JobManager(ResourceManager), it will also exit by itself. You can look at the time period during which the HB timeout occurred and what happened in the log. Under

How does JobManager terminate dangling task manager

2021-05-12 Thread narasimha
Hi, Trying to understand how JobManager. kills TaskManager that didn't respond for heartbeat after a certain time. For example: If a network connection b/w JobManager and TaskManager is lost for some reasons, the JobManager will bring up another Taskmanager post hearbeat timeout. In such a