[ 
https://issues.apache.org/jira/browse/SPARK-6883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15864309#comment-15864309
 ] 

holdenk commented on SPARK-6883:
--------------------------------

Let's consider re-opening this for discussion - do we maybe want to just depend 
on cloudpickle now that we have a pip installable PySpark we could simple add 
it to the `install_requires` and add a `requirements.txt` file for people who 
prefer to install PySpark the old school way.

What do [~joshrosen] & [~davies] think about this?

> Fork pyspark's cloudpickle as a separate dependency
> ---------------------------------------------------
>
>                 Key: SPARK-6883
>                 URL: https://issues.apache.org/jira/browse/SPARK-6883
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark
>            Reporter: Kyle Kelley
>              Labels: fork
>
> IPython, pyspark, picloud/multyvac/cloudpipe all rely on cloudpickle from 
> various sources (cloud, pyspark, and multyvac correspondingly). It would be 
> great to have this as a separately maintained project that can:
> * Work with Python3
> * Add tests!
> * Use higher order pickling (when on Python3)
> * Be installed with pip
> We're starting this off at the PyCon sprints under 
> https://github.com/cloudpipe/cloudpickle. We'd like to coordinate with 
> PySpark to make it work across all the above mentioned projects.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to