Hyukjin Kwon created SPARK-33530:
------------------------------------

             Summary: Support --archives option natively
                 Key: SPARK-33530
                 URL: https://issues.apache.org/jira/browse/SPARK-33530
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
    Affects Versions: 3.1.0
            Reporter: Hyukjin Kwon


Currently, {{spark-submit --archives}} and {{spark.yarn.dist.archives}} 
configuration are only supported in Yarn modes:

{code}
spark-submit --help
...
 Spark on YARN only:
  --queue QUEUE_NAME          The YARN queue to submit to (Default: "default").
  --archives ARCHIVES         Comma separated list of archives to be extracted 
into the
                              working directory of each executor.
{code}

This is actually critical for PySpark to support shipping other packages 
together, see also 
https://hyukjin-spark.readthedocs.io/en/stable/user_guide/python_packaging.html#using-zipped-virtual-environment.

Due to this missing feature, PySpark cannot support conda env to ship other 
packages together.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to