OOZIE-130: build workflow progress information in Oozie
-------------------------------------------------------

                 Key: OOZIE-547
                 URL: https://issues.apache.org/jira/browse/OOZIE-547
             Project: Oozie
          Issue Type: New Feature
            Reporter: Hadoop QA
            Assignee: Roman Shaposhnik


For a user, knowing progress of her workflow is always desirable. This ticket 
is to introduce this support to Oozie.
I know it's a hard problem. For my initial effort, I plan to start with simple 
workflows that do not contain decision nodes or fork/join nodes, i.e., chain 
type workflows. I plan to use percentage of finished actions as the overall wf 
progress estimate.

Going forward we can improve the estimation by:
1) handle general workflows that contain decision, fork/join nodes;
2) incorporate the action level progress into wf level progress estimation to 
make the estimate better. To be more specific:
In the case of "opaque" actions like pig/hive/jaql where the status can only be 
0% or 100% (or failure) we plug that value into the overall DAG status of 
0-100%. If a DAG had say 4 opaque actions, the progress would move in discrete 
steps 0, 25, 50, 75, 100%.  For the m/r actions where the JobTracker
gives values between 0-100% for an action then the overall progress will be 
smoother. We can do same thing for pig/hive/jaql actions as well if they expose 
their own progress info.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to