Tsuyoshi OZAWA updated YARN-2052:

    Attachment: YARN-2052.11.patch

Updated a patch to address the comments:
* Bumped up the version of FileSystemRMStateStore.
* Refactored  {{getAndIncrement}} of FileSystemStateStore/ZKRMStateStore to 
remove duplicated check of the epoch znode/file.
* Renamed RMEpoch.java to Epoch.java and RMEpochPBImpl.java to 
EpochPBImpl.java. For the consistency, updated the file/znode name of 
EPOCH_NODE from "RMEpochNode" to "EpochNode".

> ContainerId creation after work preserving restart is broken
> ------------------------------------------------------------
>                 Key: YARN-2052
>                 URL: https://issues.apache.org/jira/browse/YARN-2052
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Tsuyoshi OZAWA
>            Assignee: Tsuyoshi OZAWA
>         Attachments: YARN-2052.1.patch, YARN-2052.10.patch, 
> YARN-2052.11.patch, YARN-2052.2.patch, YARN-2052.3.patch, YARN-2052.4.patch, 
> YARN-2052.5.patch, YARN-2052.6.patch, YARN-2052.7.patch, YARN-2052.8.patch, 
> YARN-2052.9.patch, YARN-2052.9.patch
> Container ids are made unique by using the app identifier and appending a 
> monotonically increasing sequence number to it. Since container creation is a 
> high churn activity the RM does not store the sequence number per app. So 
> after restart it does not know what the new sequence number should be for new 
> allocations.

This message was sent by Atlassian JIRA

Reply via email to