[
https://issues.apache.org/jira/browse/HADOOP-5420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731554#action_12731554
]
Hemanth Yamijala commented on HADOOP-5420:
------------------------------------------
bq. This issue is marked as Fixed. Open a new JIRA with a description about the
issue and attach the patch.
Rajiv, the reason why there is no new jira is because this is a bug that occurs
only on the Yahoo! distribution and not on trunk.
To give an explanation, the patch HADOOP-5420-v20.patch introduced some code to
make sure the pid files written by the task-controller were owned by the
tasktracker process, as a security check. Inadvertently, in this patch, we
removed some code that changed the ownership of the pid file (which was written
as root) to be owned by the TT user. As a result, pid files were created as
root, but the new check introduced in the patch failed during kill because it
found the PID files were not owned by the TT user and hence treated them as
suspect. Hence tasks failed to be killed causing runaway processes on the
cluster.
The attached patch re-introduces the code that changes ownership of the pid
file to the TT user so that during killing the security check would pass and
processes would be killed.
> Support killing of process groups in LinuxTaskController binary
> ---------------------------------------------------------------
>
> Key: HADOOP-5420
> URL: https://issues.apache.org/jira/browse/HADOOP-5420
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 0.21.0
> Reporter: Sreekanth Ramakrishnan
> Assignee: Sreekanth Ramakrishnan
> Fix For: 0.21.0
>
> Attachments: 5420-fix-ydist.patch, hadoop-5420-1.patch,
> hadoop-5420-10.patch, hadoop-5420-11.patch, hadoop-5420-12.patch,
> hadoop-5420-2.patch, hadoop-5420-3.patch, hadoop-5420-4.patch,
> hadoop-5420-5.patch, hadoop-5420-6.patch, hadoop-5420-7.patch,
> hadoop-5420-8.patch, hadoop-5420-9.patch, HADOOP-5420-v20.patch,
> hadoop-5420.patch
>
>
> Support setsid based kill in LinuxTaskController.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.