Xingbo Jiang created SPARK-37151: ------------------------------------ Summary: Avoid executor state sync attempt fail continuously in a short timeframe Key: SPARK-37151 URL: https://issues.apache.org/jira/browse/SPARK-37151 Project: Spark Issue Type: Improvement Components: Spark Core Affects Versions: 3.2.0 Reporter: Xingbo Jiang Assignee: Xingbo Jiang
An executor would retry sending the ExecutorStateChanged message when the previous attempt failed. This would not be an issue when the attempt failed with TimeoutException. But if the connection between the executor and the Master is broken, the attempt would fail immediately, leading to the retry attempt also fail, and quickly reaches the max attempt limitation. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org