Nick DiQuattro created ARROW-9676:
-------------------------------------
Summary: Import structs as lists
Key: ARROW-9676
URL: https://issues.apache.org/jira/browse/ARROW-9676
Project: Apache Arrow
Issue Type: Wish
Components: R
Affects Versions: 1.0.0
Environment: Amazon Linux, 32gb of ram
Reporter: Nick DiQuattro
When trying to collect data from a dataset based on parquet files with nested
structs (column is a struct with 2 structs nested) of moderate size (1Mish
rows), R crashes. If I add a filter to reduce the number of rows, the data is
parsed. If I select out the struct column, it works great (up to 21M rows). My
hunch is the structs resulting in data.frame columns may be the issue. I am
curious if there's a way to have arrow import structs as lists instead of
data.frames. Thanks for the direction to hereĀ [~neilr8133]!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)