Github user kayousterhout commented on a diff in the pull request:
https://github.com/apache/spark/pull/20640#discussion_r169500415
--- Diff:
resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosCoarseGrainedSchedulerBackend.scala
---
@@ -571,7 +568,7 @@ private[spark] class MesosCoarseGrainedSchedulerBackend(
cpus + totalCoresAcquired <= maxCores &&
mem <= offerMem &&
numExecutors < executorLimit &&
- slaves.get(slaveId).map(_.taskFailures).getOrElse(0) <
MAX_SLAVE_FAILURES &&
+ !scheduler.nodeBlacklist().contains(slaveId) &&
--- End diff --
In other places it looks like the hostname is used in the blacklist - why
does this check against the slaveId instead of the offerHostname?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]