[
https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16474664#comment-16474664
]
Bryan Cutler commented on SPARK-21187:
--------------------------------------
Hi [~ewohlstadter], thanks for the interest! The Map type needs some work to
be done in Arrow to be fully supported, and then it can implemented for Spark.
We are making this a requirement for Arrow 1.0, if not before then. As for the
interval type, I don't believe it is an external type for Spark SQL so it
wasn't planned.
> Complete support for remaining Spark data types in Arrow Converters
> -------------------------------------------------------------------
>
> Key: SPARK-21187
> URL: https://issues.apache.org/jira/browse/SPARK-21187
> Project: Spark
> Issue Type: Umbrella
> Components: PySpark, SQL
> Affects Versions: 2.3.0
> Reporter: Bryan Cutler
> Assignee: Bryan Cutler
> Priority: Major
>
> This is to track adding the remaining type support in Arrow Converters.
> Currently, only primitive data types are supported. '
> Remaining types:
> * -*Date*-
> * -*Timestamp*-
> * *Complex*: Struct, -Array-, Arrays of Date/Timestamps, Map
> * -*Decimal*-
> * *Binary* - in pyspark
> Some things to do before closing this out:
> * -Look to upgrading to Arrow 0.7 for better Decimal support (can now write
> values as BigDecimal)-
> * -Need to add some user docs-
> * -Make sure Python tests are thorough-
> * Check into complex type support mentioned in comments by [~leif], should
> we support mulit-indexing?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]