[
https://issues.apache.org/jira/browse/ARROW-840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861515#comment-16861515
]
Joris Van den Bossche commented on ARROW-840:
---------------------------------------------
So the first bullet point (enabling "defining extension types in Python)
requires to implement a C++ PythonExtensionType that can translate python
function callbacks to the actual ExtensionType methods?
I looked into that a bit some time ago, and I think it is above my current C++
skill level (at least to start it). [~pitrou] is that something that you might
want to look at?
Once the basics are there, I am very much interested to further help with this
and do further work to enable pandas ExtensionArray interaction.
> [Python] Provide Python API for creating user-defined data types that can
> survive Arrow IPC
> -------------------------------------------------------------------------------------------
>
> Key: ARROW-840
> URL: https://issues.apache.org/jira/browse/ARROW-840
> Project: Apache Arrow
> Issue Type: New Feature
> Components: Python
> Reporter: Wes McKinney
> Priority: Major
> Fix For: 0.14.0
>
>
> The user will provide:
> * Data type subclass that can indicate the physical storage type
> * "get state" and "set state" functions for serializing custom metadata to
> bytes
> * An optional function for "boxing" scalar values from the physical array
> storage
> Internally, this will build on an analogous C++ API for defining user data
> types
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)