Github user squito commented on the issue: https://github.com/apache/spark/pull/21068 A couple more high-level thoughts: 1) Do we want to have a event posted about the node getting blacklisted? I think it would be useful. But then there needs to be a msg from the YarnAllocator back to the driver about the blacklisting. 2) I was thinking about how this interacts with [SPARK-13669](https://issues.apache.org/jira/browse/SPARK-13669). at first I was thinking this makes that entirely unnecessary, but I guess that is not true -- that is still useful if the external shuffle service goes down *after* the executor is started.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org