squito commented on issue #26433: [SPARK-29771][K8S] Add configure to limit 
executor failures
URL: https://github.com/apache/spark/pull/26433#issuecomment-553153970
 
 
   on how this change relates to blacklisting: yes, the distinction proposed 
here makes sense, blacklisting does not cover this case.  Blacklisting is 
intended for when there is a problem with some subset of resources (eg. a bad 
disk on some nodes), not for complete application failure like this case.
   
   There is a related issue which is under blacklisting, which isnt' currently 
handled: if containers fail to start on only *some* nodes, then spark should 
avoid using those nodes https://issues.apache.org/jira/browse/SPARK-23485

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to