[
https://issues.apache.org/jira/browse/SPARK-6883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15864309#comment-15864309
]
holdenk commented on SPARK-6883:
--------------------------------
Let's consider re-opening this for discussion - do we maybe want to just depend
on cloudpickle now that we have a pip installable PySpark we could simple add
it to the `install_requires` and add a `requirements.txt` file for people who
prefer to install PySpark the old school way.
What do [~joshrosen] & [~davies] think about this?
> Fork pyspark's cloudpickle as a separate dependency
> ---------------------------------------------------
>
> Key: SPARK-6883
> URL: https://issues.apache.org/jira/browse/SPARK-6883
> Project: Spark
> Issue Type: Improvement
> Components: PySpark
> Reporter: Kyle Kelley
> Labels: fork
>
> IPython, pyspark, picloud/multyvac/cloudpipe all rely on cloudpickle from
> various sources (cloud, pyspark, and multyvac correspondingly). It would be
> great to have this as a separately maintained project that can:
> * Work with Python3
> * Add tests!
> * Use higher order pickling (when on Python3)
> * Be installed with pip
> We're starting this off at the PyCon sprints under
> https://github.com/cloudpipe/cloudpickle. We'd like to coordinate with
> PySpark to make it work across all the above mentioned projects.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]