[ 
https://issues.apache.org/jira/browse/YARN-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13750288#comment-13750288
 ] 

Chris Riccomini commented on YARN-896:
--------------------------------------

bq. Because many of these systems roll their logs to avoid filling up disks we 
will probably need a protocol of some sort for the container to communicate 
with the Node Manager when logs are ready to be processed. 

Somewhat related, and a random thought: Samza is currently piping stdout and 
stderr to files when it sets its AM and container commands (... 1>logs/stdout 
2>logs/stderr). These files never get rolled, and they can get very big for 
long-lived processes (10s-100s of gigs). We encourage using a logging system 
(log4j) to handle this stuff, but there are cases where the recommended use is 
not followed (sigh :)).

What do you guys think about tweaking the NM to consume the STDOUT/STDERR 
streams from the ProcessBuilder when it executes AM/container commands, writing 
them to stdout/stderr files, and rolling them periodically (e.g. daily)?
                
> Roll up for long lived YARN
> ---------------------------
>
>                 Key: YARN-896
>                 URL: https://issues.apache.org/jira/browse/YARN-896
>             Project: Hadoop YARN
>          Issue Type: New Feature
>            Reporter: Robert Joseph Evans
>
> YARN is intended to be general purpose, but it is missing some features to be 
> able to truly support long lived applications and long lived containers.
> This ticket is intended to
>  # discuss what is needed to support long lived processes
>  # track the resulting JIRA.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to