[ 
https://issues.apache.org/jira/browse/SPARK-22216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Li Jin updated SPARK-22216:
---------------------------
    Description: This is an umbrella ticket tracking the general effort to 
improve performance and interoperability between PySpark and Pandas. The core 
idea is to Apache Arrow as serialization format to reduce the overhead between 
PySpark and Pandas.  (was: This is an umbrella ticket tracking the general 
effect of improving performance and interoperability between PySpark and 
Pandas. The core idea is to Apache Arrow as serialization format to reduce the 
overhead between PySpark and Pandas.)

> Improving PySpark/Pandas interoperability
> -----------------------------------------
>
>                 Key: SPARK-22216
>                 URL: https://issues.apache.org/jira/browse/SPARK-22216
>             Project: Spark
>          Issue Type: Epic
>          Components: PySpark
>    Affects Versions: 2.2.0
>            Reporter: Li Jin
>            Assignee: Li Jin
>            Priority: Major
>
> This is an umbrella ticket tracking the general effort to improve performance 
> and interoperability between PySpark and Pandas. The core idea is to Apache 
> Arrow as serialization format to reduce the overhead between PySpark and 
> Pandas.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to