I would like to modify the Oozie code to introduce a new scheduling pattern in Hadoop. I am new to Oozie. I read that there is a file called workflow.xml which has the actions that are to be performed by Hadoop. I want to introduce a new field to the job, something like a JOB_TYPE. For eg, if a job belongs to TYPE_1, then it should be replicated in all the worker nodes. If a job belongs to TYPE_2, then it should be replicated in only a fraction of nodes. Is it possible to modify the parser of Oozie which parses the workflow.xml? Please do help
-- Tina
