Maxim Khutornenko created AURORA-1242:
-----------------------------------------

             Summary: Consider new task states to track package fetching and 
task launching
                 Key: AURORA-1242
                 URL: https://issues.apache.org/jira/browse/AURORA-1242
             Project: Aurora
          Issue Type: Story
          Components: Executor, Scheduler
            Reporter: Maxim Khutornenko


There is currently little to no visibility into task launching activities. Once 
a task reaches {{RUNNING}} it's considered entering an application realm where 
the only way to dissect the warmup period is through examining thermos 
processes (if they exist). The warmup may take arbitrary long time to complete 
exacerbating the visibility problem even further. 

Another example is docker container pull (AURORA-1059) where a task stays in 
{{ASSIGNED}} until a docker pull completes. This skews our SLA metrics and 
risks aborting the task due to exceeding transient task timeout.

We should consider adding more task states to track package/container fetch and 
launching/warmup activities explicitly, e.g.:
{{ASSIGNED}} -> {{FETCHING}} ->{{LAUNCHING|STARTING}} -> {{RUNNING}}

The above would require modifying the schema to require explicit definition for 
package fetching.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to