Ranju created SPARK-34389:
-----------------------------
Summary: Spark job on Kubernetes scheduled For Zero or less than
minimum number of executors and Wait indefinitely under resource starvation
Key: SPARK-34389
URL: https://issues.apache.org/jira/browse/SPARK-34389
Project: Spark
Issue Type: Bug
Components: Kubernetes
Affects Versions: 3.0.1
Reporter: Ranju
In case Cluster does not have sufficient resource (CPU/ Memory ) for minimum
number of executors , the executors goes in Pending State for indefinite time
until the resource gets free.
Suppose, Cluster Configurations are:
total Memory=204Gi
used Memory=200Gi
free memory= 4Gi
SPARK.EXECUTOR.MEMORY=10G
SPARK.DYNAMICALLOCTION.MINEXECUTORS=4
SPARK.DYNAMICALLOCATION.MAXEXECUTORS=8
Rather, the job should be cancelled if requested number of minimum executors
are not availableĀ at that point of time because of resource unavailability.
Currently it is doing partial scheduling or no scheduling and waiting
indefinitely. And the job got stuck.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]