Christian Kadner created BAHIR-35:
-------------------------------------

             Summary: Include Python code in the binary jars for use with 
"--packages ..."
                 Key: BAHIR-35
                 URL: https://issues.apache.org/jira/browse/BAHIR-35
             Project: Bahir
          Issue Type: Task
          Components: Build
    Affects Versions: 2.0.0
            Reporter: Christian Kadner


Currently, to make use the PySpark code (i.e streaming-mqtt/python) a user will 
have to download the jar from Maven central or clone the code from GitHub and 
then have to find individual *.py files, create a zip and add that to the 
{{spark-submit}} command with the {{--py-files}} option, or, add them to the 
{{PYTHONPATH}} when running locally.

If we include the Python code in the binary build (to the jar that gets 
uploaded to Maven central), then users need not do any acrobatics besides using 
the {{--packages ...}} option.

An example where the Python code is part of the binary jar is the 
[GraphFrames|https://spark-packages.org/package/graphframes/graphframes] 
package.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to