Nicholas Chammas created SPARK-33017:
----------------------------------------
Summary: PySpark Context should have getCheckpointDir() method
Key: SPARK-33017
URL: https://issues.apache.org/jira/browse/SPARK-33017
Project: Spark
Issue Type: Improvement
Components: PySpark
Affects Versions: 3.1.0
Reporter: Nicholas Chammas
To match the Scala API, PySpark should offer a direct way to get the checkpoint
dir.
{code:scala}
scala> spark.sparkContext.setCheckpointDir("/tmp/spark/checkpoint")
scala> spark.sparkContext.getCheckpointDir
res3: Option[String] =
Some(file:/tmp/spark/checkpoint/34ebe699-bc83-4c5d-bfa2-50451296cf87)
{code}
Currently, the only was to do that from PySpark is via the underlying Java
context:
{code:python}
>>> spark.sparkContext.setCheckpointDir('/tmp/spark/checkpoint/')
>>> sc._jsc.sc().getCheckpointDir().get()
'file:/tmp/spark/checkpoint/ebf0fab5-edbc-42c2-938f-65d5e599cf54'
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]