James Peach created MESOS-9172:
----------------------------------
Summary: Fetcher deadlock with duplicated URIs.
Key: MESOS-9172
URL: https://issues.apache.org/jira/browse/MESOS-9172
Project: Mesos
Issue Type: Bug
Components: fetcher
Reporter: James Peach
Assignee: James Peach
If the fetcher cache is empty and you launch a task that contains duplicate
URIs, the fetcher deadlocks waiting for the futures in
{{FetcherProcess::_fetch}}.
What happens is that when the fetcher is setting up the initial match of cache
lookup futures in {{FetcherProcess::fetch}}, the duplicate URIs cause cache
hits on the placeholder cache entries. This code is assuming that there is
already an operation in flight that will populate the cache entry. However, the
cache is currently empty - the placeholder entry is caused by a the duplicate
in the task's URIs.
When we await the futures in {{FetcherProcess::_fetch}}, we end up waiting for
the future that indicated the cache entry becomes populated, but that won't
ever happen because we need to make progress on the current fetching batch in
order to populate the cache entry. At this point we are live-locked.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)