HyukjinKwon commented on pull request #30735: URL: https://github.com/apache/spark/pull/30735#issuecomment-744681441
Oh, actually `PYSPARK_PYTHON` (or `spark.pyspark.python`) is being passed though from driver to executor side via: 1. https://github.com/apache/spark/blob/e2cdfcebd9b39a1104b34d8eafafbcdc6acf5d3e/python/pyspark/context.py#L230 2. https://github.com/apache/spark/blob/3959f0d9879fa7fa9e8f2e8ed8c8b12003d21788/python/pyspark/sql/udf.py#L35 3. https://github.com/apache/spark/blob/485145326a9c97ede260b0e267ee116f182cfd56/core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala#L64-L65 4. https://github.com/apache/spark/blob/3959f0d9879fa7fa9e8f2e8ed8c8b12003d21788/core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala#L92 5. https://github.com/apache/spark/blob/3959f0d9879fa7fa9e8f2e8ed8c8b12003d21788/core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala#L144 I also added a test to make sure this works in K8S here: https://github.com/apache/spark/pull/30735/files#diff-78ba045f393bcf6ffaa3dfe85bc7682cacf0bef69d447a2346e201279cc0bc5bR179-R197 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
