All, I am trying to create an Oozie xml file from a DAG data structure.
My data structure (and actions to run) would look like that: start----->A1,A2 A1----> A7 A2----->A7,A3,A4 A3----->A6 A7,A6,A4---->end Now if the actions output are only temporary, I would like to delete them asap, I name D the delete actions: start----->A1,A2 A1----> A7 A2----->A7,A3,A4 A3----->A6,[A3,A4,A7] A7----->D1,[A3,A4,A7] A4----->[A3,A4,A7] A3,A4,A7---->D2 A6 ----> D3 D1----->join_end D2------>join_end D3------>join_end join_end ----->end I have two questions on forks/joins: The documentation says that I need to have fork and join in pair, is that still true? Or is it only better, I just read a post from Virag the 19th of July: "To me, the nested forks option you are considering looks good. Its also better to have the join in pair." I would like that my workflow finishes even through one branch fails, let say that A7 fails, I would like that A6 and A4 proceed. For this type of behaviour can I link all my error transitions to join_end? It will not be possible if I have to pair forks and joins. Regards, Étienne
