Sandy Ryza created SPARK-52853:
----------------------------------

             Summary: Prevent setting Spark confs on the session within 
Pipelines Python definition files
                 Key: SPARK-52853
                 URL: https://issues.apache.org/jira/browse/SPARK-52853
             Project: Spark
          Issue Type: Improvement
          Components: Declarative Pipelines
    Affects Versions: 4.1.0
            Reporter: Sandy Ryza


Setting spark session confs is an imperative construct that can cause friction 
and unexpected behavior from within a pipeline declaration. E.g. it makes 
pipeline behavior sensitive to the order that Python files are imported in, 
which can be unpredictable. There are already existing mechanisms for setting 
Spark confs for pipelines:
 * The configuration setting in the pipeline spec
 * The `spark_conf` arguments on various Python decorators

Raising an error when someone tries to invoke to set a conf in this situation 
would avoid this unpredictable behavior.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to