Hi All,
I am running into confusion with the SLA with oozie in Hue. I am seeing a
mismatch in the time that hue states that a workflow runs and complete
versus email notification on SLA misses. I have set up a specific workflow
with a coordinator that is scheduled to run everyday at 15:30 and 16:30 GMT.
I have enabled SLA notification in the coordinator setting the following
parameters:
Nominal Time: ${nominal_time}
Should Start: ${60 * MINUTES}
Duration: ${60 * MINUTES}
Alert Events: start_miss, duration_miss
For a specific run recently I got the SLA Start miss notification:
Status:
SLA Status - START_MISS
Job Status - WAITING
Job Details:
App Type - COORDINATOR_ACTION
Job ID - 0000232-160817212356434-oozie-oozi-C@16
SLA Details:
Nominal Time - Thu Aug 18 20:05:00 GMT 2016
Expected Start Time - Thu Aug 18 21:05:00 GMT 2016
Expected End Time - Thu Aug 18 20:35:00 GMT 2016
Expected Duration (in mins) - 60
Actual Duration (in mins) - -1
But when going into HUE and look at run 16 I see this:
LogsIdNameTypeStatusExternal IdStart TimeEnd TimeError CodeError Message
TransitionData
0001987-160817212356434-oozie-oozi-W@subworkflow-1ed4
<https://hue-mint.adconion.com/oozie/list_oozie_workflow_action/0001987-160817212356434-oozie-oozi-W%40subworkflow-1ed4/?coordinator_job_id=0000232-160817212356434-oozie-oozi-C>
subworkflow-1ed4 sub-workflow OK 0001988-160817212356434-oozie-oozi-W
<https://hue-mint.adconion.com/oozie/list_oozie_workflow/0001988-160817212356434-oozie-oozi-W/>
Fri,
26 Aug 2016 15:30:00 Fri, 26 Aug 2016 15:32:52 subworkflow-f644
0001987-160817212356434-oozie-oozi-W@subworkflow-f644
<https://hue-mint.adconion.com/oozie/list_oozie_workflow_action/0001987-160817212356434-oozie-oozi-W%40subworkflow-f644/?coordinator_job_id=0000232-160817212356434-oozie-oozi-C>
subworkflow-f644 sub-workflow OK 0001989-160817212356434-oozie-oozi-W
<https://hue-mint.adconion.com/oozie/list_oozie_workflow/0001989-160817212356434-oozie-oozi-W/>
Fri,
26 Aug 2016 15:32:52 Fri, 26 Aug 2016 15:34:27
The workflow actually started on time. Also I am not sure where the
nominal time in SLA is coming from, I would assume it would be around
15:30 instead
of 20:05. Not sure if I am completely missing something.
CDH Version: 5.7.2
Hue Version: 3.9