Alex Khakhlyuk created SPARK-55071:
--------------------------------------

             Summary: Make spark.addArtifact work with Windows paths
                 Key: SPARK-55071
                 URL: https://issues.apache.org/jira/browse/SPARK-55071
             Project: Spark
          Issue Type: Bug
          Components: Connect, PySpark
    Affects Versions: 4.1.1
            Reporter: Alex Khakhlyuk


Currently, `spark.addArtifact` in pyspark connect does not support absolute 
Windows paths.

E.g. this code

```

spark.addArtifact("C:\\Users\\alex.khakhlyuk\\hey.py", pyfile=True)

```

will result in the following error

```

PySparkRuntimeError: [UNSUPPORTED_OPERATION] c scheme is not supported.

```

This error is caused by `urlparse` function in 
[artifact.py.|https://github.com/apache/spark/blob/ac13473fff64919e8e7756e3a42ce3a68627dd73/python/pyspark/sql/connect/client/artifact.py#L188]
 It incorrectly interprets local Windows path, e.g. `C:\path\to\file` as a URI 
with 'C' scheme and throws an error because this URI scheme is not known and 
not supported.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to