Jason Lowe created YARN-426:
-------------------------------
Summary: Failure to download a public resource on a node prevents
further downloads of the resource from that node
Key: YARN-426
URL: https://issues.apache.org/jira/browse/YARN-426
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Affects Versions: 0.23.6, 2.0.3-alpha
Reporter: Jason Lowe
Assignee: Jason Lowe
Priority: Critical
If the NM encounters an error while downloading a public resource, it fails to
empty the list of request events corresponding to the resource request in
{{attempts}}. If the same public resource is subsequently requested on that
node, {{PublicLocalizer.addResource}} will skip the download since it will
mistakenly believe a download of that resource is already in progress. At that
point any container that requests the public resource will just hang in the
{{LOCALIZING}} state.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira