[ https://issues.apache.org/jira/browse/AURORA-1459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zameer Manji updated AURORA-1459: --------------------------------- Fix Version/s: 0.10.0 > DelayExecutor is flaky within scheduling loop > --------------------------------------------- > > Key: AURORA-1459 > URL: https://issues.apache.org/jira/browse/AURORA-1459 > Project: Aurora > Issue Type: Bug > Components: Scheduler > Reporter: Maxim Khutornenko > Assignee: Bill Farner > Fix For: 0.10.0 > > > TaskGroups now uses DelayExecutor introduced to gate async operations. The > problem though is that DelayExecutor queue is only flushed on DB transaction > completion (1). This means no scheduling can ever proceed unless there is > _some_ storage mutation activity. If/when there are no storage writes > scheduling effectively halts. > While it unlikely to happen in production, it is consistently reproducible > with e2e tests in vagrant on any subsequent run. > (1) - > https://github.com/apache/aurora/blob/06ddaadbcba4c66b8019815de6ca27d50a9df77d/src/main/java/org/apache/aurora/scheduler/storage/db/DbStorage.java#L175-L178 -- This message was sent by Atlassian JIRA (v6.3.4#6332)