[ 
https://issues.apache.org/jira/browse/ARROW-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17210489#comment-17210489
 ] 

Bryan Cutler commented on ARROW-1614:
-------------------------------------

I just wanted to let you all know I have been working on a similar Tensor 
extension type. I currently have a Pandas extension type for a tensor with 
conversion to/from an Arrow extension type, just for Python/PyArrow right now, 
and zero-copy conversion with numpy.ndarrays. It's part of the project [Text 
Extensions for Pandas|https://github.com/CODAIT/text-extensions-for-pandas] 
where we use it for NLP feature vectors, but it's really general purpose. You 
can check it out at

[https://github.com/CODAIT/text-extensions-for-pandas/blob/master/text_extensions_for_pandas/array/tensor.py]
 
[https://github.com/CODAIT/text-extensions-for-pandas/blob/master/text_extensions_for_pandas/array/arrow_conversion.py]
 Or install the package if you like via {{pip install 
text-extensions-for-pandas}} (it's currently in alpha)

We would love to help out with this effort and contribute what we have to 
Arrow, if it fits the bill!

> [C++] Add a Tensor logical value type with constant dimensions, implemented 
> using ExtensionType
> -----------------------------------------------------------------------------------------------
>
>                 Key: ARROW-1614
>                 URL: https://issues.apache.org/jira/browse/ARROW-1614
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, Format
>            Reporter: Wes McKinney
>            Priority: Major
>
> In an Arrow table, we would like to add support for a column that has values 
> cells each containing a tensor value, with all tensors having the same 
> dimensions. These would be stored as a binary value, plus some metadata to 
> store type and shape/strides.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to