[GitHub] [spark] sunchao commented on a change in pull request #35109: [SPARK-37819][K8S] Add OUTLIER executor roll policy and use it by default

GitBox Wed, 05 Jan 2022 16:06:03 -0800


sunchao commented on a change in pull request #35109:
URL: https://github.com/apache/spark/pull/35109#discussion_r779207954




##########
File path: 
resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/Config.scala
##########
@@ -147,26 +147,28 @@ private[spark] object Config extends Logging {
       .createWithDefault(0)
 
   object ExecutorRollPolicy extends Enumeration {
-    val ID, ADD_TIME, TOTAL_GC_TIME, TOTAL_DURATION, AVERAGE_DURATION, 
FAILED_TASKS = Value
+    val ID, ADD_TIME, TOTAL_GC_TIME, TOTAL_DURATION, AVERAGE_DURATION, 
FAILED_TASKS, OUTLIER = Value
   }
 
   val EXECUTOR_ROLL_POLICY =
     ConfigBuilder("spark.kubernetes.executor.rollPolicy")
-      .doc("Executor roll policy: Valid values are ID, ADD_TIME, TOTAL_GC_TIME 
(default), " +
-        "TOTAL_DURATION, and FAILED_TASKS. " +
+      .doc("Executor roll policy: Valid values are ID, ADD_TIME, 
TOTAL_GC_TIME, " +
+        "TOTAL_DURATION, FAILED_TASKS, and OUTLIER (default). " +
         "When executor roll happens, Spark uses this policy to choose " +
         "an executor and decommission it. The built-in policies are based on 
executor summary." +
         "ID policy chooses an executor with the smallest executor ID. " +
         "ADD_TIME policy chooses an executor with the smallest add-time. " +
         "TOTAL_GC_TIME policy chooses an executor with the biggest total task 
GC time. " +
         "TOTAL_DURATION policy chooses an executor with the biggest total task 
time. " +
         "AVERAGE_DURATION policy chooses an executor with the biggest average 
task time. " +
-        "FAILED_TASKS policy chooses an executor with the most number of 
failed tasks.")
+        "FAILED_TASKS policy chooses an executor with the most number of 
failed tasks. " +
+        "OUTLIER policy chooses an executor with outstanding statistics if 
exists. " +

Review comment:
       nit: maybe explain a bit what does "outstanding statistics" mean?

##########
File path: 
resource-managers/kubernetes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/ExecutorRollPlugin.scala
##########
@@ -116,7 +117,39 @@ class ExecutorRollDriverPlugin extends DriverPlugin with 
Logging {
         listWithoutDriver.sortBy(e => e.totalDuration.toFloat / Math.max(1, 
e.totalTasks)).reverse
       case ExecutorRollPolicy.FAILED_TASKS =>
         listWithoutDriver.sortBy(_.failedTasks).reverse
+      case ExecutorRollPolicy.OUTLIER =>
+        // We build multiple outlier lists and concat in the following 
importance order to find
+        // outliers in various perspective:
+        //   AVERAGE_DURATION > TOTAL_DURATION > TOTAL_GC_TIME > FAILED_TASKS
+        // Since we will choose only first item, the duplication is okay. If 
there is no outlier,
+        // We fallback to TOTAL_DURATION policy.
+        outliers(listWithoutDriver.filter(_.totalTasks > 0), e => 
e.totalDuration / e.totalTasks) ++
+          outliers(listWithoutDriver, e => e.totalDuration) ++
+          outliers(listWithoutDriver, e => e.totalGCTime) ++
+          outliers(listWithoutDriver, e => e.failedTasks) ++
+          listWithoutDriver.sortBy(_.totalDuration).reverse
     }
     sortedList.headOption.map(_.id)
   }
+
+  /**
+   * Return executors whose metrics is outstanding, '(value - mean) > 
2-sigma'. This is a

Review comment:
       It'd be nice if we have some reference to this calculation.

##########
File path: 
resource-managers/kubernetes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/ExecutorRollPlugin.scala
##########
@@ -116,7 +117,39 @@ class ExecutorRollDriverPlugin extends DriverPlugin with 
Logging {
         listWithoutDriver.sortBy(e => e.totalDuration.toFloat / Math.max(1, 
e.totalTasks)).reverse
       case ExecutorRollPolicy.FAILED_TASKS =>
         listWithoutDriver.sortBy(_.failedTasks).reverse
+      case ExecutorRollPolicy.OUTLIER =>
+        // We build multiple outlier lists and concat in the following 
importance order to find
+        // outliers in various perspective:
+        //   AVERAGE_DURATION > TOTAL_DURATION > TOTAL_GC_TIME > FAILED_TASKS
+        // Since we will choose only first item, the duplication is okay. If 
there is no outlier,
+        // We fallback to TOTAL_DURATION policy.
+        outliers(listWithoutDriver.filter(_.totalTasks > 0), e => 
e.totalDuration / e.totalTasks) ++
+          outliers(listWithoutDriver, e => e.totalDuration) ++
+          outliers(listWithoutDriver, e => e.totalGCTime) ++
+          outliers(listWithoutDriver, e => e.failedTasks) ++
+          listWithoutDriver.sortBy(_.totalDuration).reverse
     }
     sortedList.headOption.map(_.id)
   }
+
+  /**
+   * Return executors whose metrics is outstanding, '(value - mean) > 
2-sigma'. This is a
+   * best-effort approach because the snapshot of ExecutorSummary is not a 
normal distribution.
+   * In case of normal distribution, this is known to be 2.5 percent.
+   */
+  private def outliers(
+      list: Seq[v1.ExecutorSummary],
+      get: v1.ExecutorSummary => Float): Seq[v1.ExecutorSummary] = {
+    if (list.isEmpty) {
+      list
+    } else {
+      val size = list.size
+      val mean = list.map(get).sum / size
+      val sd = sqrt(list.map(e => (get(e) - mean) * (get(e) - mean)).sum / 
size)

Review comment:
       nit: maybe we can optimize this a bit and only call `get(e)` once?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] sunchao commented on a change in pull request #35109: [SPARK-37819][K8S] Add OUTLIER executor roll policy and use it by default

Reply via email to