[
https://issues.apache.org/jira/browse/AURORA-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Santhosh Kumar Shanmugham updated AURORA-1933:
----------------------------------------------
Fix Version/s: 0.18.0
> Scheduler can process rescind before offer
> ------------------------------------------
>
> Key: AURORA-1933
> URL: https://issues.apache.org/jira/browse/AURORA-1933
> Project: Aurora
> Issue Type: Bug
> Reporter: Zameer Manji
> Assignee: Zameer Manji
> Fix For: 0.18.0
>
>
> I observed the following in production:
> {noformat}
> Jun 6 00:31:32 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:32.510
> [Thread-77638, MesosCallbackHandler$MesosCallbackHandlerImpl:229] Offer
> rescinded: 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552
> Jun 6 00:31:32 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:32.903
> [SchedulerImpl-0, MesosCallbackHandler$MesosCallbackHandlerImpl:211] Received
> offer: 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552
> Jun 6 00:31:34 compute1159-dca1 aurora-scheduler[23675]: I0606 00:31:34.815
> [TaskGroupBatchWorker, VersionedSchedulerDriverService:123] Accepting offer
> 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552 with ops [LAUNCH]
> {noformat}
> Notice the rescind was processed before the offer was given. This means the
> offer is in the offer storage, but using it is invalid. It will cause
> whatever task launched with it to fail with {{Task launched with invalid
> offers: Offer 81e04cbd-9bce-41cf-bd94-38c911f255e4-O142359552 is no longer
> valid}}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)