[ 
https://issues.apache.org/jira/browse/YARN-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated YARN-1575:
-----------------------------

    Attachment: YARN-1575.patch
                YARN-1575.branch-0.23.patch

Attaching a blunt way to solve the race condition, which is to synchronize the 
queueing and update of {{pending}}.  This basically defeats the point of 
{{pending}} being a ConcurrentHashMap, so I updated it to a synchronized map 
since some unit tests are accessing it asynchronously.

For 0.23 we already are synchronizing {{attempts}}, so I piggy-backed the 
synchronization on that variable.

> Public localizer crashes with "Localized unkown resource"
> ---------------------------------------------------------
>
>                 Key: YARN-1575
>                 URL: https://issues.apache.org/jira/browse/YARN-1575
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>    Affects Versions: 0.23.10, 2.2.0
>            Reporter: Jason Lowe
>            Priority: Critical
>         Attachments: YARN-1575.branch-0.23.patch, YARN-1575.patch
>
>
> The public localizer can crash with the error:
> {noformat}
> 2014-01-08 14:11:43,212 [Thread-467] ERROR 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
>  Localized unkonwn resource to java.util.concurrent.FutureTask@852e26
> 2014-01-08 14:11:43,212 [Thread-467] INFO 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
>  Public cache exiting
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to