[GitHub] [zeppelin] dbanda opened a new pull request #3838: add jars to sys.path as done in pyspark shell

GitBox Thu, 02 Jul 2020 12:35:00 -0700


dbanda opened a new pull request #3838:
URL: https://github.com/apache/zeppelin/pull/3838



   ### What is this PR for?
   A few sentences describing the overall goals of the pull request's commits.
   First time? Check out the contributing guide - 
https://zeppelin.apache.org/contribution/contributions.html
   
   Some jars include python code. These must be added to sys.path to support 
importing those packages in python.
   
   pyspark include these jars to the path at context initialization. From the 
spark repo see:
   
   * core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala
   * python/pyspark/context.py
   * python/pyspark/shell.py
   
   In pyspark, the jars passed with "–packages" are passed onto 
"spark.submit.pyFiles" (in prepareSubmitEnvironment function) and then added to 
sys.path by the context initialization.
   
   ### What type of PR is it?
   [Bug Fix | Improvement | Feature | Documentation | Hot Fix | Refactoring]
   Bug Fix
   
   ### Todos
   * [ ] - Task
   
   ### What is the Jira issue?
   * Open an issue on Jira https://issues.apache.org/jira/browse/ZEPPELIN/
   * Put link here, and add [ZEPPELIN-*Jira number*] in PR title, eg. 
[ZEPPELIN-533]
   
   https://issues.apache.org/jira/browse/ZEPPELIN-4934
   
   ### How should this be tested?
   * First time? Setup Travis CI as described on 
https://zeppelin.apache.org/contribution/contributions.html#continuous-integration
   * Strongly recommended: add automated unit tests for any new or changed 
behavior
   * Outline any manual steps to test the PR here.
   
   ### Screenshots (if appropriate)
   
   ### Questions:
   * Does the licenses files need update?
   * Is there breaking changes for older versions?
   * Does this needs documentation?
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [zeppelin] dbanda opened a new pull request #3838: add jars to sys.path as done in pyspark shell

Reply via email to