Github user mccheah commented on a diff in the pull request:
https://github.com/apache/spark/pull/8007#discussion_r38568718
--- Diff:
yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala ---
@@ -590,6 +605,18 @@ private[spark] class ApplicationMaster(
case None => logWarning("Container allocator is not ready to
kill executors yet.")
}
context.reply(true)
+
+ case GetExecutorLossReason(eid) =>
+ Option(allocator) match {
+ case Some(a) =>
+ pendingLossReasonRequests.synchronized {
+ pendingLossReasonRequests
--- End diff --
We can probably check if the executor exists in the executor id map before
trying to add to pendingLossReasonRequests to remove the race. Need to be
careful about synchronization however.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]