Zameer Manji created AURORA-1933:
------------------------------------
Summary: Scheduler can process rescind before offer
Key: AURORA-1933
URL: https://issues.apache.org/jira/browse/AURORA-1933
Project: Aurora
Issue Type: Bug
Reporter: Zameer Manji
Assignee: Zameer Manji
I observed the following in production:
{noformat}
Jun 6 00:31:32 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:32.510
[Thread-77638, MesosCallbackHandler$MesosCallbackHandlerImpl:229] Offer
rescinded: 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552
Jun 6 00:31:32 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:32.903
[SchedulerImpl-0, MesosCallbackHandler$MesosCallbackHandlerImpl:211] Received
offer: 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552
Jun 6 00:31:34 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:34.815
[TaskGroupBatchWorker, VersionedSchedulerDriverService:123] Accepting offer
81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552 with ops [LAUNCH]
{noformat}
Notice the rescind was processed before the offer was given. This means the
offer is in the offer storage, but using it is invalid. It will cause whatever
task launched with it to fail with {{Task launched with invalid offers: Offer
81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552 is no longer valid}}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)