[
https://issues.apache.org/jira/browse/YARN-10684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sushanta Sen updated YARN-10684:
--------------------------------
Description:
Preconditions:
# Secure Hadoop 3.1.1 - 3 Nodes cluster is installed
# Set the below parameters in RM yarn-site.xml ::<property>
<name>yarn.resourcemanager.opportunistic-container-allocation.enabled</name>
<value>true</value>
</property>
# Set this in NM[s]yarn-site.xml ::: <property>
<name>yarn.nodemanager.opportunistic-containers-max-queue-length</name>
<value>30</value>
</property>
Test Steps:
Job Command : :
Job Command :: Job Command : : yarn
org.apache.hadoop.yarn.applications.distributedshell.Client jar
HDFS/hadoop/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-3.1.1*.jar
-shell_command sleep -shell_args 20 -num_containers 20 -container_type
OPPORTUNISTIC -*promote_opportunistic_after_start*
Actual Result: Distributed Shell Yarn Job Failed almost all times with below
Diagnostics message
*[ Failed Reason : Application Failure: desired = 10, completed = 10, allocated
= 10, failed = 2, diagnostics = [2021-02-10 00:00:27.640]Container Killed to
make room for Guaranteed Container.]*
Expected Result: DS job should be successful with argument
"promote_opportunistic_after_start" ** **
was:
Preconditions:
# Secure Hadoop 3.1.1 - 3 Nodes cluster is installed
# Set the below parameters in RM yarn-site.xml ::<property>
<name>yarn.resourcemanager.opportunistic-container-allocation.enabled</name>
<value>true</value>
</property>
# Set this in NM[s]yarn-site.xml ::: <property>
<name>yarn.nodemanager.opportunistic-containers-max-queue-length</name>
<value>30</value>
</property>
Test Steps:
Job Command : :
Job Command :: yarn org.apache.hadoop.yarn.applications.distributedshell.Client
-jar
HDFS/hadoop/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-3.1.1-hw-ei-310001-SNAPSHOT.jar
-shell_command sleep -shell_args 20 -num_containers 10 -container_type
OPPORTUNISTIC -*promote_opportunistic_after_start*
Actual Result: Distributed Shell Yarn Job Failed almost all times with below
Diagnostics message
*[ Failed Reason : Application Failure: desired = 10, completed = 10, allocated
= 10, failed = 2, diagnostics = [2021-02-10 00:00:27.640]Container Killed to
make room for Guaranteed Container.]*
Expected Result: DS job should be successful with argument
"promote_opportunistic_after_start" * ** *
> YARN: Opportunistic Container :: Distributed YARN Job has Failed when tried
> adding flag -promote_opportunistic_after_start
> ---------------------------------------------------------------------------------------------------------------------------
>
> Key: YARN-10684
> URL: https://issues.apache.org/jira/browse/YARN-10684
> Project: Hadoop YARN
> Issue Type: Bug
> Components: distributed-scheduling
> Affects Versions: 3.1.1
> Reporter: Sushanta Sen
> Priority: Major
>
> Preconditions:
> # Secure Hadoop 3.1.1 - 3 Nodes cluster is installed
> # Set the below parameters in RM yarn-site.xml ::<property>
> <name>yarn.resourcemanager.opportunistic-container-allocation.enabled</name>
> <value>true</value>
> </property>
> # Set this in NM[s]yarn-site.xml ::: <property>
> <name>yarn.nodemanager.opportunistic-containers-max-queue-length</name>
> <value>30</value>
> </property>
>
> Test Steps:
> Job Command : :
> Job Command :: Job Command : : yarn
> org.apache.hadoop.yarn.applications.distributedshell.Client jar
> HDFS/hadoop/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-3.1.1*.jar
> -shell_command sleep -shell_args 20 -num_containers 20 -container_type
> OPPORTUNISTIC -*promote_opportunistic_after_start*
> Actual Result: Distributed Shell Yarn Job Failed almost all times with below
> Diagnostics message
> *[ Failed Reason : Application Failure: desired = 10, completed = 10,
> allocated = 10, failed = 2, diagnostics = [2021-02-10 00:00:27.640]Container
> Killed to make room for Guaranteed Container.]*
> Expected Result: DS job should be successful with argument
> "promote_opportunistic_after_start" ** **
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]