[ https://issues.apache.org/jira/browse/SPARK-52853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sandy Ryza reassigned SPARK-52853: ---------------------------------- Assignee: Jacky Wang > Prevent setting Spark confs on the session within Pipelines Python definition > files > ----------------------------------------------------------------------------------- > > Key: SPARK-52853 > URL: https://issues.apache.org/jira/browse/SPARK-52853 > Project: Spark > Issue Type: Sub-task > Components: Declarative Pipelines > Affects Versions: 4.1.0 > Reporter: Sandy Ryza > Assignee: Jacky Wang > Priority: Major > Labels: pull-request-available > > Setting spark session confs is an imperative construct that can cause > friction and unexpected behavior from within a pipeline declaration. E.g. it > makes pipeline behavior sensitive to the order that Python files are imported > in, which can be unpredictable. There are already existing mechanisms for > setting Spark confs for pipelines: > * The configuration setting in the pipeline spec > * The `spark_conf` arguments on various Python decorators > Raising an error when someone tries to invoke to set a conf in this situation > would avoid this unpredictable behavior. > > The two ways to set confs from Python are: > * spark.conf.set("key", "value") > * spark.sql("SET key=value") -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org