Jayjeet Chakraborty created ARROW-17216:
-------------------------------------------

             Summary: Support joining tables with non-key fields as list
                 Key: ARROW-17216
                 URL: https://issues.apache.org/jira/browse/ARROW-17216
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: Jayjeet Chakraborty


I am trying to join 2 Arrow tables where some columns are of {{list<float>}} 
data type. Note that my join columns/keys are primitive data types and some my 
non-join columns/keys are of {{{}list<float>{}}}. But, PyArrow {{join()}} 
cannot join such as table, although pandas can. It says

{{{{ArrowInvalid: Data type list<item: float> is not supported in join non-key 
field}}}}

when I execute this piece of code

{{joined_table = table_1.join(table_2, ['k1', 'k2', 'k3'])}}

A 
[stackoverflow|https://stackoverflow.com/questions/73071105/listitem-float-not-supported-in-join-non-key-field]
 response pointed out that Arrow currently cannot handle non-fixed types for 
joins. Can this be fixed ? Or is this intentional ?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to