[
https://issues.apache.org/jira/browse/YARN-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jason Lowe updated YARN-1575:
-----------------------------
Attachment: YARN-1575.patch
YARN-1575.branch-0.23.patch
Attaching a blunt way to solve the race condition, which is to synchronize the
queueing and update of {{pending}}. This basically defeats the point of
{{pending}} being a ConcurrentHashMap, so I updated it to a synchronized map
since some unit tests are accessing it asynchronously.
For 0.23 we already are synchronizing {{attempts}}, so I piggy-backed the
synchronization on that variable.
> Public localizer crashes with "Localized unkown resource"
> ---------------------------------------------------------
>
> Key: YARN-1575
> URL: https://issues.apache.org/jira/browse/YARN-1575
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: nodemanager
> Affects Versions: 0.23.10, 2.2.0
> Reporter: Jason Lowe
> Priority: Critical
> Attachments: YARN-1575.branch-0.23.patch, YARN-1575.patch
>
>
> The public localizer can crash with the error:
> {noformat}
> 2014-01-08 14:11:43,212 [Thread-467] ERROR
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
> Localized unkonwn resource to java.util.concurrent.FutureTask@852e26
> 2014-01-08 14:11:43,212 [Thread-467] INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
> Public cache exiting
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)