[ 
https://issues.apache.org/jira/browse/YARN-3981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15448925#comment-15448925
 ] 

Rohith Sharma K S commented on YARN-3981:
-----------------------------------------

Hi [~sjlee0], thanks for the summary of the discussion.

bq. This client would launch a special YARN app under the covers whose app 
master and its associated timeline writer can serve as the proxy for timeline 
data the client may write. When this special timeline client shuts down, it 
would tear down the associated YARN app also.
I assume launching a special is just same as normal-app where in YARN 
frameworks take care of writing all the corresponding data to ATS2. Are you 
referring special app as Uber job? Even uber job, for each application there 
will be corresponding another job in the list. This doubles the storage are 
also. 

One another approach is
As part of NM daemon, start new service same as TimeLineWriterWebService. Idea 
is NM reports all these collector address to RM.   Introduce new API in 
clientRMservice to get collector address. Address is given by RM in random(This 
can be decided later). This address is used by timeline client. TimeLineClient  
exposes new constructor with an flowName. So system properties can be written 
at flow level.
These special entities are stored in separate table i.e with key 
*clusterid!flowName!entityType!entityId*. 

I would appreciate if folks give their suggestions/comments on the new approach.

> support timeline clients not associated with an application
> -----------------------------------------------------------
>
>                 Key: YARN-3981
>                 URL: https://issues.apache.org/jira/browse/YARN-3981
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>    Affects Versions: YARN-2928
>            Reporter: Sangjin Lee
>            Assignee: Rohith Sharma K S
>              Labels: YARN-5355
>
> In the current v.2 design, all timeline writes must belong in a 
> flow/application context (cluster + user + flow + flow run + application).
> But there are use cases that require writing data outside the context of an 
> application. One such example is a higher level client (e.g. tez client or 
> hive/oozie/cascading client) writing flow-level data that spans multiple 
> applications. We need to find a way to support them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to