Kishor Patil created SPARK-17511:
------------------------------------
Summary: Dynamic allocation race condition: Containers getting
marked failed while releasing
Key: SPARK-17511
URL: https://issues.apache.org/jira/browse/SPARK-17511
Project: Spark
Issue Type: Bug
Components: YARN
Affects Versions: 2.0.0, 2.0.1, 2.1.0
Reporter: Kishor Patil
While trying to reach launch multiple containers in pool, if running executors
count reaches or goes beyond the target running executors, the container is
released and marked failed. This can cause many jobs to be marked failed
causing overall job failure.
I will have a patch up soon after completing testing.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]