Github user kayousterhout commented on a diff in the pull request:
https://github.com/apache/spark/pull/14079#discussion_r72537255
--- Diff:
core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala ---
@@ -83,8 +85,15 @@ private[spark] class TaskSetManager(
val copiesRunning = new Array[Int](numTasks)
val successful = new Array[Boolean](numTasks)
private val numFailures = new Array[Int](numTasks)
- // key is taskId (aka TaskInfo.index), value is a Map of executor id to
when it failed
- private val failedExecutors = new HashMap[Int, HashMap[String, Long]]()
+ val execToFailures: HashMap[String, FailureStatus] = new HashMap()
+ /**
+ * Map from node to all executors on it with failures. Needed because
we want to know about
+ * executors on a node even after they have died.
+ */
+ private val nodesToExecsWithFailures: HashMap[String, HashSet[String]] =
new HashMap()
+ private val nodeBlacklistedTasks: HashMap[String, HashSet[Int]] = new
HashMap()
--- End diff --
nodeToBlacklistedTasks
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]