[
https://issues.apache.org/jira/browse/YARN-3576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14528689#comment-14528689
]
Rohith commented on YARN-3576:
------------------------------
What job and how many containers are running? If AM is killing the container
with 143 exit code indicates containers are gracefully finished when the RM is
the process of transitioning. Does all the the containers running on the
NodeManager killed or few killed?
> In Log - Container getting killed by AM even when work preserving is enabled
> -----------------------------------------------------------------------------
>
> Key: YARN-3576
> URL: https://issues.apache.org/jira/browse/YARN-3576
> Project: Hadoop YARN
> Issue Type: Bug
> Environment: SUSE11 SP3
> 3 nodes cluster
> Reporter: Anushri
> Priority: Minor
>
> RM in HA mode
> NM running on one node
> work preserving enabled
> RM in HA mode one NM running work preserving is enabled An application is
> submitted and RM switch over happens. In the NM log it is found that AM kills
> some of the containers and those containers have exit code as 143. but in the
> container logs , logs are found for the same container.
> Problem :
> if work preserving is enabled why is it killing and cleaning the container?
> and if the container is getting killed , why is its log present in container
> logs?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)