Joris Van den Bossche created ARROW-14286:
---------------------------------------------

             Summary: [Python][Parquet] Allow to select columns of a list field 
without requiring the list component names
                 Key: ARROW-14286
                 URL: https://issues.apache.org/jira/browse/ARROW-14286
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Joris Van den Bossche


Subtask for ARROW-14196.

Currently, if you have a list column, where the list elements itself are nested 
items (eg a list of structs), selecting a subset of that list column requires 
something like {{columns=["columnA.list.item.subfield"]}}. While this 
"list.item" is superfluous, since a list always contains a single child. So 
ideally we allow to specify this as {{columns=["columnA.subfield"]}}. 

This also avoids relying on the exact name of the list item (item vs element), 
for which the default differs between Parquet and Arrow.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to