Hi,

I have a parquet dataset containing "order" structs each of which has a
list of "item" structs.  I would like to read a subset of the item structs.
e.g.

order_id: int64

...other fields...

items: list<item: struct<item_id: int64, price: int64, ...other fields...>>


# is this/will this be possible?

dataset.to_table(columns=["order_id", "items.item_id", items.price"])


I guess they'd be lists of scalars rather than a list of structs with fewer
fields?

I couldn't see any reference to *lists* in
https://github.com/apache/arrow/pull/11466.

Is this possible or planned?  Is there another way to achieve this?

Thanks in advance,

Tim

Reply via email to