Dejiu Lu created SPARK-55888:
--------------------------------
Summary: Parallel pod creation for Kubernetes executor allocation
Key: SPARK-55888
URL: https://issues.apache.org/jira/browse/SPARK-55888
Project: Spark
Issue Type: Improvement
Components: Kubernetes
Affects Versions: 4.1.1
Reporter: Dejiu Lu
Right now, `ExecutorPodsAllocator` creates pods one by one — it sends a create
request to the K8s API, waits for the response, then moves on to the next one.
This works fine for small jobs, but becomes a real bottleneck when you need
thousands of executors.
In our production environment, it takes 3412s to request 10,000 executors in
Kubernetes mode, but only 34s in YARN mode. We achieved a significant speedup
by changing the executor request from serial to parallel.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]