Zameer Manji created AURORA-1933:
------------------------------------

             Summary: Scheduler can process rescind before offer
                 Key: AURORA-1933
                 URL: https://issues.apache.org/jira/browse/AURORA-1933
             Project: Aurora
          Issue Type: Bug
            Reporter: Zameer Manji
            Assignee: Zameer Manji


I observed the following in production:
{noformat}
Jun  6 00:31:32 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:32.510 
[Thread-77638, MesosCallbackHandler$MesosCallbackHandlerImpl:229] Offer 
rescinded: 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552
Jun  6 00:31:32 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:32.903 
[SchedulerImpl-0, MesosCallbackHandler$MesosCallbackHandlerImpl:211] Received 
offer: 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552
Jun  6 00:31:34 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:34.815 
[TaskGroupBatchWorker, VersionedSchedulerDriverService:123] Accepting offer 
81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552 with ops [LAUNCH]
{noformat}

Notice the rescind was processed before the offer was given. This means the 
offer is in the offer storage, but using it is invalid. It will cause whatever 
task launched with it to fail with {{Task launched with invalid offers: Offer 
81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552 is no longer valid}}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to