Petri created SPARK-37537:
-----------------------------
Summary: Spark 3.2.0 driver pod does not mount checkpoint
filesystem from Kubernetes PVC
Key: SPARK-37537
URL: https://issues.apache.org/jira/browse/SPARK-37537
Project: Spark
Issue Type: Bug
Components: Spark Submit
Affects Versions: 3.2.0
Reporter: Petri
I have Spark 3.2.0 driver executing in Kubernetes pod in client mode and
following configs has been defined in spark-submit:
{code:java}
--deploy-mode client
--conf
spark.kubernetes.driver.volumes.persistentVolumeClaim.glustervol.mount.path=/mnt/distributedDisk
--conf
spark.kubernetes.driver.volumes.persistentVolumeClaim.glustervol.readOnly=false
--conf
spark.kubernetes.driver.volumes.persistentVolumeClaim.glustervol.options.claimName=lolastreamingapp-conf
spark.kubernetes.executor.volumes.persistentVolumeClaim.glustervol.mount.path=/mnt/distributedDisk
--conf
spark.kubernetes.executor.volumes.persistentVolumeClaim.glustervol.readOnly=false
--conf
spark.kubernetes.executor.volumes.persistentVolumeClaim.glustervol.options.claimName=lolastreamingapp
{code}
I face a problem when starting the driver pod that it cannot access the
filesystem mounted from GlusterFS PVC. I can see that driver pod has not
mounted the PVC when describing the pod. I can also see that PVC is not mounted
when describing the PVC.
This has been working with Spark version 2.4.x, but not with Spark 3.2.0.
Only notable change we have between using Spark version 2.4.x and 3.2.0 is that
in 2.4.x we used deploy-mode cluster and in 3.2.0 we use deploy-mode client.
Because the filesystem used for checkpointing is not mounted properly, we get
following kind of error in our application:
{code:java}
java.io.FileNotFoundException: File
/mnt/distributedDisk/SE/LolaStreamingApp/1.0.0/1468589949 does not exist
at
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:779)
~[hadoop-client-api-3.3.1.jar:?]
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1100)
~[hadoop-client-api-3.3.1.jar:?]
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:769)
~[hadoop-client-api-3.3.1.jar:?]
at
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:462)
~[hadoop-client-api-3.3.1.jar:?]
at
org.apache.spark.streaming.StreamingContext.checkpoint(StreamingContext.scala:240)
~[spark-streaming_2.12-3.2.0.jar:3.2.0]
at
org.apache.spark.streaming.api.java.JavaStreamingContext.checkpoint(JavaStreamingContext.scala:509)
~[spark-streaming_2.12-3.2.0.jar:3.2.0] {code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]