Github user mridulm commented on the issue:
https://github.com/apache/spark/pull/20998
Adding isRunning can cause a single 'bad' node (from task pov - not
necessarily only bad hardware: just that task fails on node) can keep tasks to
fail repeatedly causing app to exit.
Particularly with blacklist'ing, I am not very sure how the interactions
will play out .. @squito might have more comments.
In general, this is not a benign change imo and can have non trivial side
effects.
In the specific usecase of only two machines, it is an unfortunate side
effect.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]