[ 
https://issues.apache.org/jira/browse/YARN-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14522790#comment-14522790
 ] 

Vinod Kumar Vavilapalli commented on YARN-3044:
-----------------------------------------------

Apologies for dropping off the "send info from RM vs from NM" discussion mid 
way through.

We all agree that sending information from NMs is *more scalable*.

The concern isn't really about information ownership. RM and NM both form the 
platform, so we can rely on NMs to publish information. But it's really about 
potential *loss of information* in many not-so rare cases like when container 
gets allocated but gets preempted or released by AM before it really starts.

As long as containers successfully start on NMs (which will be the vast 
majority assuming the cluster isn't bad), we can rely on NMs to post all sorts 
of information - allocation time, wait time, execution time, information like 
priority, host, port , resource-usage-over-time etc. We can just tunnel some of 
the RM-originated information through AMs to the NM.

The missing dots occur when a container's life-cycle ends either on the RM or 
the AM. We can take a dual pronged approach here? That or we make the 
RM-publisher itself a distributed push.

> [Event producers] Implement RM writing app lifecycle events to ATS
> ------------------------------------------------------------------
>
>                 Key: YARN-3044
>                 URL: https://issues.apache.org/jira/browse/YARN-3044
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>            Reporter: Sangjin Lee
>            Assignee: Naganarasimha G R
>         Attachments: YARN-3044-YARN-2928.004.patch, 
> YARN-3044.20150325-1.patch, YARN-3044.20150406-1.patch, 
> YARN-3044.20150416-1.patch
>
>
> Per design in YARN-2928, implement RM writing app lifecycle events to ATS.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to