Filip Darmanovic created SPARK-55397:
----------------------------------------

             Summary: The waitToReusePersistentVolumeClaim feature assumes only 
one PVC per executor
                 Key: SPARK-55397
                 URL: https://issues.apache.org/jira/browse/SPARK-55397
             Project: Spark
          Issue Type: Improvement
          Components: Kubernetes
    Affects Versions: 3.5.8, 4.1.0, 3.5.2
            Reporter: Filip Darmanovic


Spark supports attaching multiple disks to a worker, which is very useful for 
increasing throughput, however, if the PVC re-use feature is enabled and the 
podAllocOnPVC flag is on, meaning that the driver should wait to re-use PVCs, 
the max number of PVCs is limited to the max number of workers.

So what happens in practice is that if you ask for 4 workers with 4 PVCs each, 
Spark will assign 4 PVCs to the first worker and then stop spawning new workers.



I think we should just keep track of the number of PVCs spawned so far and wait 
for the ones that are not bound.

I'm willing to work on this PR if I can get help with integration tests.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to