Github user tgravescs commented on the pull request:
https://github.com/apache/spark/pull/12735#issuecomment-215436163
Actually after looking a bit more, Spark does fail fast if the shuffle
service isn't there because very soon after start up the BlockManager registers
with the shuffle service so if it didn't come up the executors should fail
quickly. Is this what you were seeing?
This to me isn't so bad, at least it isn't wasting a bunch of work. Yes
new executors could get scheduled there but they should fail very quickly
without wasting working.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]