Rohith Sharma K S created YARN-8346:
---------------------------------------

             Summary: Upgrading to 3.1 kills running containers with error 
"Opportunistic container queue is full"
                 Key: YARN-8346
                 URL: https://issues.apache.org/jira/browse/YARN-8346
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Rohith Sharma K S


It is observed while rolling upgrade from 2.8.4 to 3.1 release, all the running 
containers are killed and second attempt is launched for that application. The 
diagnostics message is "Opportunistic container queue is full" which is the 
reason for container killed. 

In NM log, I see below logs for after container is recovered.
{noformat}
2018-05-23 17:18:50,655 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.scheduler.ContainerScheduler:
 Opportunistic container [container_e06_1527075664705_0001_01_000001] will not 
be queued at the NMsince max queue length [0] has been reached
{noformat}

Following steps are executed for rolling upgrade
# Install 2.8.4 cluster and launch a MR job with distributed cache enabled.
# Stop 2.8.4 RM. Start 3.1.0 RM with same configuration.
# Stop 2.8.4 NM batch by batch. Start 3.1.0 NM batch by batch. 




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to