Hi, here is a pretty nice try: https://github.com/klout/scoozie
It gives a chance to convert flow written with Sala DSL into xml after
parameters substitution and just right before execution.

*Usecase*
You want to run variable list of jobs
(sqoop/pig/spark/whatever_it_doesn_matter)

start -> action_A -> action_B -> ... -> action_Z -> end

Here is the Sqoozie snippet to do it:
https://gist.github.com/seregasheypak/2009c84b546b4f38316bfbfa180d8d25

Here is the result:
https://gist.github.com/seregasheypak/976d114898536bb37d206a075cd7471d

Pros:
- Type safe
- No xml mess
- Short code footprint
- Allows to generate DAG using outer properties. A kind of shifting to
"meta DAG" level.

Cons:
Needs plugin/tool/addon in front of oozie to:
- Generate xml
- submit generated xml to Ooozie
- definitely tool should take care of wflow dependencies.

What would you say? Is it worth doing?

Reply via email to