[
https://issues.apache.org/jira/browse/ARROW-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17100822#comment-17100822
]
Joris Van den Bossche commented on ARROW-1614:
----------------------------------------------
[~chrish42] for experimenting, you could also first implement it in Python (to
have a prototype, to experiment with and discuss how to store metadata, etc,
that might be easier to play with).
For Python, we have docs here:
https://arrow.apache.org/docs/python/extending_types.html#defining-extension-types-user-defined-types
(and there are also examples in the pandas codebase (for intervals and period
dtypes):
https://github.com/pandas-dev/pandas/blob/master/pandas/core/arrays/_arrow_utils.py,
or also in the pyarrow tests:
https://github.com/apache/arrow/blob/master/python/pyarrow/tests/test_extension_type.py)
For implementing it in C++, probably best source are the test extension types:
https://github.com/apache/arrow/blob/master/cpp/src/arrow/testing/extension_type.h
and
https://github.com/apache/arrow/blob/master/cpp/src/arrow/extension_type_test.cc
> [C++] Add a Tensor logical value type with constant dimensions, implemented
> using ExtensionType
> -----------------------------------------------------------------------------------------------
>
> Key: ARROW-1614
> URL: https://issues.apache.org/jira/browse/ARROW-1614
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++, Format
> Reporter: Wes McKinney
> Priority: Major
>
> In an Arrow table, conceivably a column could have values cells each
> containing a tensor value of some size (a binary value plus some metadata to
> store type and shape/strides)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)