Hi,

With OOZIE-1311 and its subtasks,  the idea seems to move all the launcher 
classes like PigMain, HiveMain etc. to  their respective sharelibs.
So, now shared lib is a mandatory deployment step. Before shared lib was 
optional as users could bundle jars with their workflow application.
So always requiring shared lib seems to introduce 2 problems:

  1.  The current deployments which don't use action shared lib will fail. So, 
probably we should deprecate the current behavior.

2. The hadoop distributed cache mechanism will fail a job if the files in DC 
are updated on hdfs while the hadoop job is running. So, when Oozie is 
restarted and shared lib is uploaded to hdfs as part of
             deployment, hadoop  will fail the existing jobs for which the 
timestamp of  the file on hdfs doesn't match the timestamp of its copy in the 
job's DC.


Thanks,
Virag




Reply via email to