[jira] [Commented] (OOZIE-1767) Java Action automatically record child job ids

Mona Chitnis (JIRA) Fri, 04 Apr 2014 15:09:24 -0700

    [ 
https://issues.apache.org/jira/browse/OOZIE-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13960485#comment-13960485
 ]


Mona Chitnis commented on OOZIE-1767:
-------------------------------------

The idea behind not having this functionality was to discourage users from 
using java action to spawn own MR jobs and use the map-reduce action instead. I 
see that in your use case of using other frameworks like Crunch/Cascading you'd 
need to use some custom logic before launching jobs - so map-reduce action 
won't be enough.

We don't scrape logs for all action types. For pig action and map-reduce 
action, we use the PigStats and JobClient API respectively to get the actual 
job ids. Logic used by hive-action might actually be sub-optimal.


> Java Action automatically record child job ids
> ----------------------------------------------
>
>                 Key: OOZIE-1767
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1767
>             Project: Oozie
>          Issue Type: Improvement
>          Components: action
>            Reporter: Micah Whitacre
>
> It is common for consumers to utilize Big Data frameworks like Crunch and 
> Cascading inside of their larger processing workflows.  These frameworks when 
> ran will launch potentially one to many MR jobs.  The common way to launch 
> these is using the Java action as both frameworks have a specific API for 
> launching and executing the jobs.  Unfortunately for the default Java action, 
> those spawned processes are not tied to the Oozie action which launched them. 
>  It would be great if there was a way for jobs launched from the Java action 
> to be tied back to the Oozie action.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (OOZIE-1767) Java Action automatically record child job ids

Reply via email to