flow version in API and storage

Zhijie Shen (JIRA) Mon, 30 Mar 2015 15:16:03 -0700

    [ 
https://issues.apache.org/jira/browse/YARN-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14387515#comment-14387515
 ]


Zhijie Shen commented on YARN-3391:
-----------------------------------

bq. do you know of any such config or context parameter that can be set so that 
we can pick up the flow name from there for all yarn applications?

I propose to use app id to generate flow name, such that every orphan app will 
be put in a unique default flow. Otherwise, if we use the job name, for 
example,  all the wordcout jobs will belong to one flow then by default.

bq. Using a run id of 1 will mean everything will fall into this bucket if no 
one sets the run id.

It won't, because each orphan app will have a unique default flow.

Compare to multiple runs under the same flow as the default, I think the more 
appropriate way is that each application should have a unique flow name. Under 
one flow, there's only one default version "1", and under the version, there's 
only one run 1.



> Clearly define flow ID/ flow run / flow version in API and storage
> ------------------------------------------------------------------
>
>                 Key: YARN-3391
>                 URL: https://issues.apache.org/jira/browse/YARN-3391
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>            Reporter: Zhijie Shen
>            Assignee: Zhijie Shen
>
> To continue the discussion in YARN-3040, let's figure out the best way to 
> describe the flow.
> Some key issues that we need to conclude on:
> - How do we include the flow version in the context so that it gets passed 
> into the collector and to the storage eventually?
> - Flow run id should be a number as opposed to a generic string?
> - Default behavior for the flow run id if it is missing (i.e. client did not 
> set it)
> - How do we handle flow attributes in case of nested levels of flows?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (YARN-3391) Clearly define flow ID/ flow run / flow version in API and storage

Reply via email to