[ 
https://issues.apache.org/jira/browse/ARROW-840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861428#comment-16861428
 ] 

Wes McKinney commented on ARROW-840:
------------------------------------

[~jorisvandenbossche] [~pitrou] what do you think is the path ahead for this 
project? There are two distinct areas of work:

* Defining extension types in Python and providing Python exposure for already 
C++-defined types
* Bridging between extension types (which may be C++ or Python-defined) and 
pandas

This issue ARROW-840 covers the former but not the latter. It would be nice to 
have this feature available in 0.14.0. I'm concerned from today's date (June 
11) that we may miss the window for 0.14.0 though

> [Python] Provide Python API for creating user-defined data types that can 
> survive Arrow IPC
> -------------------------------------------------------------------------------------------
>
>                 Key: ARROW-840
>                 URL: https://issues.apache.org/jira/browse/ARROW-840
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: Python
>            Reporter: Wes McKinney
>            Priority: Major
>             Fix For: 0.14.0
>
>
> The user will provide:
> * Data type subclass that can indicate the physical storage type
> * "get state" and "set state" functions for serializing custom metadata to 
> bytes
> * An optional function for "boxing" scalar values from the physical array 
> storage
> Internally, this will build on an analogous C++ API for defining user data 
> types



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to