[ 
https://issues.apache.org/jira/browse/ARROW-5931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394070#comment-17394070
 ] 

Antoine Pitrou edited comment on ARROW-5931 at 8/5/21, 2:38 PM:
----------------------------------------------------------------

I think it would be rather weird to store a sequence of {{PyObject*}} as an 
extension array. The underlying storage data would have no well-defined meaning 
as far as Arrow is concerned. Extra care would also be needed when deallocating 
the array.

I think we should reserve extension types for free-standing data, not handles 
or references to non-Arrow resources.


was (Author: pitrou):
I think it would be rather weird to store a sequence of {{PyObject*}} as an 
extension array. The underlying storage data would have absolutely no 
well-defined meaning. Extra care would also be needed when deallocating the 
array.

I think we should reserve extension types for free-standing data, not handles 
or references to non-Arrow resources.

> [C++] Extend extension types facility to provide for serialization and 
> deserialization in IPC roundtrips
> --------------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-5931
>                 URL: https://issues.apache.org/jira/browse/ARROW-5931
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>
> A use case here is when an array needs to reference some external data. For 
> example, suppose that we wanted to implement an array that references a 
> sequence of Python objects as {{PyObject*}}. Obviously, a {{PyObject*}} must 
> be managed by the Python interpreter.
> For a vector of some {{T*}} to be sent through the IPC machinery, it must be 
> embedded in some Arrow type on the wire. For example, the memory resident 
> version of {{PyObject*}} might be 8-bytes per value (1 pointer per value) 
> while being serialized to the binary IPC protocol, such {{PyObject*}} values 
> must be serialized into an Arrow Binary type.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to