This is an automated email from the ASF dual-hosted git repository.
chengpan pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/kyuubi.git
The following commit(s) were added to refs/heads/master by this push:
new 2d64255874 [KYUUBI #6720] K8s pod OOM Killed should be identified as
Application failed state
2d64255874 is described below
commit 2d642558742385524f1c39b661c3aabd9090ea70
Author: madlnu <[email protected]>
AuthorDate: Wed Oct 2 19:12:43 2024 +0800
[KYUUBI #6720] K8s pod OOM Killed should be identified as Application
failed state
# :mag: Description
## Issue References ๐
This pull request fixes #6720
## Describe Your Solution ๐ง
If pod goes into OOMKilled state, application should be marked as KILLED,
which is eventually identified as isFailed
## Types of changes :bookmark:
- [ ] Bugfix (non-breaking change which fixes an issue)
- [ ] New feature (non-breaking change which adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
## Test Plan ๐งช
Tested locally, was able to launch new session
<img width="922" alt="kyuubi_new_session"
src="https://github.com/user-attachments/assets/b003c86f-484d-40c5-b173-847374a45b1d">
---
**Be nice. Be informative.**
Closes #6721 from Madhukar525722/OOM.
Closes #6720
cd0bdf633 [madlnu] [KYUUBI #6720] K8s pod OOM Killed should be identified
as Application failed state
Authored-by: madlnu <[email protected]>
Signed-off-by: Cheng Pan <[email protected]>
---
.../org/apache/kyuubi/engine/KubernetesApplicationOperation.scala | 3 ++-
1 file changed, 2 insertions(+), 1 deletion(-)
diff --git
a/kyuubi-server/src/main/scala/org/apache/kyuubi/engine/KubernetesApplicationOperation.scala
b/kyuubi-server/src/main/scala/org/apache/kyuubi/engine/KubernetesApplicationOperation.scala
index 41b4d44817..74dc398f05 100644
---
a/kyuubi-server/src/main/scala/org/apache/kyuubi/engine/KubernetesApplicationOperation.scala
+++
b/kyuubi-server/src/main/scala/org/apache/kyuubi/engine/KubernetesApplicationOperation.scala
@@ -34,7 +34,7 @@ import org.apache.kyuubi.config.KyuubiConf
import org.apache.kyuubi.config.KyuubiConf.{KubernetesApplicationStateSource,
KubernetesCleanupDriverPodStrategy}
import
org.apache.kyuubi.config.KyuubiConf.KubernetesApplicationStateSource.KubernetesApplicationStateSource
import
org.apache.kyuubi.config.KyuubiConf.KubernetesCleanupDriverPodStrategy.{ALL,
COMPLETED, NONE}
-import org.apache.kyuubi.engine.ApplicationState.{isTerminated,
ApplicationState, FAILED, FINISHED, NOT_FOUND, PENDING, RUNNING, UNKNOWN}
+import org.apache.kyuubi.engine.ApplicationState.{isTerminated,
ApplicationState, FAILED, FINISHED, KILLED, NOT_FOUND, PENDING, RUNNING,
UNKNOWN}
import org.apache.kyuubi.operation.OperationState
import org.apache.kyuubi.server.KyuubiServer
import org.apache.kyuubi.session.KyuubiSessionManager
@@ -535,6 +535,7 @@ object KubernetesApplicationOperation extends Logging {
case "Running" => RUNNING
case "Succeeded" => FINISHED
case "Failed" | "Error" => FAILED
+ case "OOMKilled" => KILLED
case "Unknown" => UNKNOWN
case _ =>
warn(s"The spark driver pod state: $podState is not supported, " +