Kishor Patil created SPARK-17511:
------------------------------------

             Summary: Dynamic allocation race condition: Containers getting 
marked failed while releasing
                 Key: SPARK-17511
                 URL: https://issues.apache.org/jira/browse/SPARK-17511
             Project: Spark
          Issue Type: Bug
          Components: YARN
    Affects Versions: 2.0.0, 2.0.1, 2.1.0
            Reporter: Kishor Patil


While trying to reach launch multiple containers in pool, if running executors 
count reaches or goes beyond the target running executors, the container is 
released and marked failed. This can cause many jobs to be marked failed 
causing overall job failure.

I will have a patch up soon after completing testing.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to