Hyukjin Kwon created SPARK-22979:
------------------------------------
Summary: Avoid per-record type dispatch in Python data conversion
(EvaluatePython.fromJava)
Key: SPARK-22979
URL: https://issues.apache.org/jira/browse/SPARK-22979
Project: Spark
Issue Type: Improvement
Components: PySpark, SQL
Affects Versions: 2.3.0
Reporter: Hyukjin Kwon
Seems we are type dispatching between Java objects (from Pyrolite) to Spark's
internal data format.
See
https://github.com/apache/spark/blob/3f958a99921d149fb9fdf7ba7e78957afdad1405/sql/core/src/main/scala/org/apache/spark/sql/execution/python/EvaluatePython.scala#L89-L162
Looks we can make converters each for each type and then reuse it.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]