[ 
https://issues.apache.org/jira/browse/OOZIE-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14635603#comment-14635603
 ] 

Robert Kanter commented on OOZIE-1770:
--------------------------------------

Oozie currently requires all actions in a workflow to run as the same user, so 
that's not an issue.  As far as I know, we're not planning on changing this 
anytime soon either.

With all actions run as the same user, the difference between options 2 and 3 
is this:  In option 2 and the current design, the Oozie server monitors all 
running actions (it checks on them every 10 minutes unless it receives a 
callback to check sooner).  When an action finishes, the Oozie server does some 
stuff (resolves EL Functions, determines which action to go to, handles forks, 
decision nodes, etc) and then starts the next action in the workflow.  In 
option 3, that work would all be done by a separate entity (an AM), with the 
Oozie server simply checking the status of the workflow only (though I imagine 
Coordinators and Bundles would still live in the Oozie server).  

Option 3 does have some advantages over option 2, though it would obviously 
require a lot more re-working of Oozie's architecture than option 2.  Moving 
from a transactional DB to files in HDFS may be problematic.  I'm not sure we 
actually would need to do that.  Other than Derby, the other databases let you 
have multiple connections, so each of the AMs should be able to talk to the 
database; though I don't know if this is too much pressure on the DB.  And if 
we need locks between the AMs and the Oozie server, we already have the 
ZooKeeper-based locks.  If we do option 3, I think the actions should still be 
executed in separate containers, to maintain the stability of the Workflow AM 
and to help scale out fork nodes.

That said, as far as I know, Oozie scales pretty well.  Yahoo! can comment more 
on this.  And we already have Oozie HA, which lets you run multiple Oozie 
servers.  So I'm not sure we need option 3 over option 2.

It would be good to get some input from [~rohini], [~egashira], [~puru], 
[~shwethags], [~qwertymaniac], [~bowenzhangusa], and even [~kamrul] and 
[~tucu00].

> Create Oozie Application Master for YARN
> ----------------------------------------
>
>                 Key: OOZIE-1770
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1770
>             Project: Oozie
>          Issue Type: New Feature
>            Reporter: Bowen Zhang
>            Assignee: Bowen Zhang
>         Attachments: oya-rm-screenshot.jpg, oya.patch
>
>
> After the first release of oozie on hadoop 2, it will be good if users can 
> set execution engine in oozie conf, be it YARN AM or traditional MR. We can 
> target this for post oozie 4.1 release.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to