[ 
https://issues.apache.org/jira/browse/YARN-10684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sushanta Sen updated YARN-10684:
--------------------------------
    Description: 
Preconditions:
 # Secure Hadoop 3.1.1 - 3 Nodes cluster is installed
 # Set the below parameters  in RM yarn-site.xml ::<property>
 <name>yarn.resourcemanager.opportunistic-container-allocation.enabled</name>
 <value>true</value>
 </property>
 # Set this in NM[s]yarn-site.xml ::: <property>
 <name>yarn.nodemanager.opportunistic-containers-max-queue-length</name>
 <value>30</value>
 </property>

 
 Test Steps:

Job Command : :

Job Command :: Job Command : : yarn 
org.apache.hadoop.yarn.applications.distributedshell.Client jar 
HDFS/hadoop/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-3.1.1*.jar
 -shell_command sleep -shell_args 20 -num_containers 20 -container_type 
OPPORTUNISTIC -*promote_opportunistic_after_start*

Actual Result: Distributed Shell Yarn Job Failed almost all times with below 
Diagnostics message

*[ Failed Reason : Application Failure: desired = 10, completed = 10, allocated 
= 10, failed = 2, diagnostics = [2021-02-10 00:00:27.640]Container Killed to 
make room for Guaranteed Container.]*

Expected Result: DS job should be successful with argument 
"promote_opportunistic_after_start"  **  ** 

  was:
Preconditions:
 # Secure Hadoop 3.1.1 - 3 Nodes cluster is installed
 # Set the below parameters  in RM yarn-site.xml ::<property>
 <name>yarn.resourcemanager.opportunistic-container-allocation.enabled</name>
 <value>true</value>
 </property>
 # Set this in NM[s]yarn-site.xml ::: <property>
 <name>yarn.nodemanager.opportunistic-containers-max-queue-length</name>
 <value>30</value>
 </property>

 
 Test Steps:

Job Command : :

Job Command :: yarn org.apache.hadoop.yarn.applications.distributedshell.Client 
-jar 
HDFS/hadoop/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-3.1.1-hw-ei-310001-SNAPSHOT.jar
 -shell_command sleep -shell_args 20 -num_containers 10 -container_type 
OPPORTUNISTIC -*promote_opportunistic_after_start*

Actual Result: Distributed Shell Yarn Job Failed almost all times with below 
Diagnostics message

*[ Failed Reason : Application Failure: desired = 10, completed = 10, allocated 
= 10, failed = 2, diagnostics = [2021-02-10 00:00:27.640]Container Killed to 
make room for Guaranteed Container.]*

Expected Result: DS job should be successful with argument 
"promote_opportunistic_after_start" * ** *


> YARN: Opportunistic Container :: Distributed YARN Job has Failed when tried 
> adding flag -promote_opportunistic_after_start 
> ---------------------------------------------------------------------------------------------------------------------------
>
>                 Key: YARN-10684
>                 URL: https://issues.apache.org/jira/browse/YARN-10684
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: distributed-scheduling
>    Affects Versions: 3.1.1
>            Reporter: Sushanta Sen
>            Priority: Major
>
> Preconditions:
>  # Secure Hadoop 3.1.1 - 3 Nodes cluster is installed
>  # Set the below parameters  in RM yarn-site.xml ::<property>
>  <name>yarn.resourcemanager.opportunistic-container-allocation.enabled</name>
>  <value>true</value>
>  </property>
>  # Set this in NM[s]yarn-site.xml ::: <property>
>  <name>yarn.nodemanager.opportunistic-containers-max-queue-length</name>
>  <value>30</value>
>  </property>
>  
>  Test Steps:
> Job Command : :
> Job Command :: Job Command : : yarn 
> org.apache.hadoop.yarn.applications.distributedshell.Client jar 
> HDFS/hadoop/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-3.1.1*.jar
>  -shell_command sleep -shell_args 20 -num_containers 20 -container_type 
> OPPORTUNISTIC -*promote_opportunistic_after_start*
> Actual Result: Distributed Shell Yarn Job Failed almost all times with below 
> Diagnostics message
> *[ Failed Reason : Application Failure: desired = 10, completed = 10, 
> allocated = 10, failed = 2, diagnostics = [2021-02-10 00:00:27.640]Container 
> Killed to make room for Guaranteed Container.]*
> Expected Result: DS job should be successful with argument 
> "promote_opportunistic_after_start"  **  ** 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to