[ 
https://issues.apache.org/jira/browse/HADOOP-2947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated HADOOP-2947:
--------------------------------------------

    Attachment: HADOOP-2947.2

Earlier patch kind of does a radical surgery on simpleCommand, though, as 
Hemanth suggested, this could be done without disturbing much of the current 
code. Attaching a new patch. This uses the current simpleCommand framework to 
redirect the stderr and stdout to files. Redirecting these on all daemons to 
<daemon-name.out> and <daemon-name.err> in the corresponding log directories, 
which when log-destination-uri is specified, will be archived along with hadoop 
logs onto dfs.

Tested stdout by doing a sigquit, tested stderr by forcing namenode to fail in 
intializing(for e.g removed logging jar from hadoop lib dir). Tested to make 
sure that streams are getting redirected to respective files as desired.

Added test cases also.

> [HOD] Hod should redirect stderr and stdout of Hadoop daemons to assist 
> debugging
> ---------------------------------------------------------------------------------
>
>                 Key: HADOOP-2947
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2947
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: contrib/hod
>    Affects Versions: 0.16.0
>            Reporter: Hemanth Yamijala
>            Assignee: Vinod Kumar Vavilapalli
>            Priority: Blocker
>             Fix For: 0.17.0
>
>         Attachments: HADOOP-2947, HADOOP-2947.2
>
>
> Copied from internal bug details from Koji:
> ==========================
> Sometimes JobTracker/TaskTracker starts consuming 99% cpu and stops 
> responding to 'jstack' call.  In those cases,
> usually it still responds to kill -QUIT signal which forces the jvm to dump 
> the stack to stdout.  
> Please have the stdout of JT/TT redirected to a file. 
> Adding stderr. 
> If thread has an uncaught exception, it prints out to stderr and dies.
> ==========================

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to