[
https://issues.apache.org/jira/browse/YARN-7275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
kartheek muthyala updated YARN-7275:
------------------------------------
Attachment: YARN-7275.005.patch
[~asuresh], Can you please review the following updated patch for YARN-7275.
It contains the following changes:
1. Introduced RECOVERY_COMPLETED, containerscheduler event for trying
queued recovered containers.
2. Introduced RECOVER_CONTAINER_PAUSED event in ContainerEvents for moving
the container that was recovered as paused from scheduled -> PAUSED after we
initialize the containerimpl object for it.
3. Reused the existing event –
ContainersLauncherEventType.RECOVER_CONTAINER_PAUSED for reacquiring paused
container, instead of failing.
+ earlier patch that you are aware of. The changes only take care of recovered
containers that were paused. For containers which has moved to SCHEDULED, but
didn’t get a chance to run to RUNNING, the state in which they will be
recovered is REQUESTED, hence ultimately they will be retried from the queue of
ContainerScheduler.
Let me know if the changes seems okay.
> NM Statestore cleanup for Container updates
> -------------------------------------------
>
> Key: YARN-7275
> URL: https://issues.apache.org/jira/browse/YARN-7275
> Project: Hadoop YARN
> Issue Type: Sub-task
> Reporter: Arun Suresh
> Assignee: kartheek muthyala
> Priority: Blocker
> Attachments: YARN-7275.001.patch, YARN-7275.002.patch,
> YARN-7275.003.patch, YARN-7275.004.patch, YARN-7275.005.patch
>
>
> Currently, only resource updates are recorded in the NM state store, we need
> to add ExecutionType updates as well.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]