Hyunsik Choi created TAJO-1507:
----------------------------------
Summary: Resource leak when a worker does not response KILLED
message
Key: TAJO-1507
URL: https://issues.apache.org/jira/browse/TAJO-1507
Project: Tajo
Issue Type: Improvement
Components: resource manager, worker
Reporter: Hyunsik Choi
Fix For: 0.11.0
Terminology
* QM - query master
* TM - tajo master
The query kill mechanism is as follows:
* Client sends kill command to TM.
* TM forwards the kill command to QM.
* QM disseminates the kill command to all workers.
* Corresponding workers kill tasks and response KILLED to QM.
BTW, Some workers cannot response KILLED message to QM due to its node failure,
temporary network problem, or worker restart. In this case, TM cannot retrieve
allocated resources even though the worker turns out to be normal.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)