[
https://issues.apache.org/jira/browse/ARROW-17216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575459#comment-17575459
]
Aldrin Montana commented on ARROW-17216:
----------------------------------------
Okay, re-visiting this: I commented on ARROW-17257 about how ARROW-8991 is
related, and for now _KeyColumnArray_ is essentially a flattened array. In
ARROW-8991, I will have code for flattening arrays to be able to hash them. I
think once that is done, perhaps it can be used to support this
> [C++] Support joining tables with non-key fields as list
> --------------------------------------------------------
>
> Key: ARROW-17216
> URL: https://issues.apache.org/jira/browse/ARROW-17216
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Jayjeet Chakraborty
> Priority: Major
> Labels: query-engine
>
> I am trying to join 2 Arrow tables where some columns are of {{list<float>}}
> data type. Note that my join columns/keys are primitive data types and some
> my non-join columns/keys are of {{{}list<float>{}}}. But, PyArrow {{join()}}
> cannot join such as table, although pandas can. It says
> {{ArrowInvalid: Data type list<item: float> is not supported in join non-key
> field}}
> when I execute this piece of code
> {{joined_table = table_1.join(table_2, ['k1', 'k2', 'k3'])}}
> A
> [stackoverflow|https://stackoverflow.com/questions/73071105/listitem-float-not-supported-in-join-non-key-field]
> response pointed out that Arrow currently cannot handle non-fixed types for
> joins. Can this be fixed ? Or is this intentional ?
--
This message was sent by Atlassian Jira
(v8.20.10#820010)