[
https://issues.apache.org/jira/browse/ARROW-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17658632#comment-17658632
]
Rok Mihevc commented on ARROW-1599:
-----------------------------------
This issue has been migrated to [issue
#17612|https://github.com/apache/arrow/issues/17612] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [C++][Parquet] Unable to read Parquet files with list inside struct
> -------------------------------------------------------------------
>
> Key: ARROW-1599
> URL: https://issues.apache.org/jira/browse/ARROW-1599
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, Python
> Affects Versions: 0.7.0
> Environment: Ubuntu
> Reporter: Jovann Kung
> Assignee: Micah Kornfield
> Priority: Major
> Labels: parquet
>
> Is PyArrow currently unable to read in Parquet files with a vector as a
> column? For example, the schema of such a file is below:
> {{<pyarrow._parquet.ParquetSchema object at 0x7f2d42493c88>
> mbc: FLOAT
> deltae: FLOAT
> labels: FLOAT
> features.type: INT32 INT_8
> features.size: INT32
> features.indices.list.element: INT32
> features.values.list.element: DOUBLE}}
> Using either pq.read_table() or pq.ParquetDataset('/path/to/parquet').read()
> yields the following error: ArrowNotImplementedError: Currently only nesting
> with Lists is supported.
> From the error I assume that this may be implemented in further releases?
--
This message was sent by Atlassian Jira
(v8.20.10#820010)