attilapiros commented on a change in pull request #26343: [SPARK-29683][YARN] Job will fail due to executor failures all available nodes are blacklisted URL: https://github.com/apache/spark/pull/26343#discussion_r383238115
########## File path: docs/running-on-yarn.md ########## @@ -484,6 +484,15 @@ To use a custom metrics.properties for the application master and executors, upd Comma-separated list of YARN node names which are excluded from resource allocation. </td> </tr> +<tr> + <td><code>spark.blacklist.waiting.millis</code></td> + <td>60000</td> + <td> + A waiting time in millis for more yarn resource when avaliable nodes are all in blacklist. + This helps to avoid job failing directly in some scenes. For example, HA YARN resourcemanager Review comment: Nit: remove the space after "resourcemanager" at the end of the line ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
