[
https://issues.apache.org/jira/browse/SPARK-3413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14341732#comment-14341732
]
Sean Owen commented on SPARK-3413:
----------------------------------
This looks like it might be stale. It's also not a great deal of info to go on.
The driver should be rescheduling tasks that fail or whose executors fail,
right? Is there more info? can this still be reproduced?
> Spark Blocked due to Executor lost in FIFO MODE
> -----------------------------------------------
>
> Key: SPARK-3413
> URL: https://issues.apache.org/jira/browse/SPARK-3413
> Project: Spark
> Issue Type: Bug
> Components: Spark Core
> Affects Versions: 0.9.2
> Reporter: Patrick Liu
>
> I run spark on yarn.
> Spark scheduler is running in FIFO mode.
> I have 80 worker instances setup. However, as time passes, some worker will
> be lost. (Killed by JVM when OOM, etc).
> But some tasks will still run in those executors.
> Obviously the task will never finished.
> Then the stage will not finish. So the later stages will be blocked.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]