GitHub user BryanCutler opened a pull request:
https://github.com/apache/spark/pull/15028
[SPARK-17336][PYSPARK] Fix appending multiple times to PYTHONPATH from
spark-config.sh
## What changes were proposed in this pull request?
During startup of Spark standalone, the script file spark-config.sh appends
to the PYTHONPATH and can be sourced many times, causing duplicates in the
path. This change adds a env flag that is set once the PYTHONPATH is appended
so it will happen once.
## How was this patch tested?
Manually started standalone master/worker and verified PYTHONPATH has no
duplicate entries.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/BryanCutler/spark
fix-duplicate-pythonpath-SPARK-17336
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/15028.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #15028
----
commit 180a17f2ba0ac35ec7794d1dd681ec6396da803d
Author: Bryan Cutler <[email protected]>
Date: 2016-09-09T17:32:43Z
fix appending multiple times to PYTHONPATH when spark-config is sourced
more than once
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]