[ 
https://issues.apache.org/jira/browse/YARN-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13819992#comment-13819992
 ] 

Hitesh Shah commented on YARN-1390:
-----------------------------------

Also, from an implementation point of view, I would assume this map would be 
*not* be searchable. Free-form text or even a set of variable key-val pairs are 
expensive to search. Only defined fields such as applicationType ( which would 
contain only a single value ) should be searchable. 

bq. Representing applicationType as a set should suffice.

Representing it as a set is fine. However, how do you expect Oozie to pass 
source info to Pig which in turn will pass it to MR ? Are you assuming the 
source info is just a simple well defined string such as "Oozie" or would Oozie 
do something like "Oozie:workflowId=1234" ? I think lineage is something which 
YARN does not need to know or understand at the moment. Better to support it 
via the free-form map instead of introducing a new field which we are not sure 
how we plan to use/handle/support. 



> Provide a way to capture source of an application to be queried through REST 
> or Java Client APIs
> ------------------------------------------------------------------------------------------------
>
>                 Key: YARN-1390
>                 URL: https://issues.apache.org/jira/browse/YARN-1390
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: api
>    Affects Versions: 2.2.0
>            Reporter: Karthik Kambatla
>            Assignee: Karthik Kambatla
>
> In addition to other fields like application-type (added in YARN-563), it is 
> useful to have an applicationSource field to track the source of an 
> application. The application source can be useful in (1) fetching only those 
> applications a user is interested in, (2) potentially adding source-specific 
> optimizations in the future. 
> Examples of sources are: User-defined project names, Pig, Hive, Oozie, Sqoop 
> etc.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to