[
https://issues.apache.org/jira/browse/ARROW-8138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062935#comment-17062935
]
Feng Tian commented on ARROW-8138:
----------------------------------
I attached a quick repro – bug.parquet is a data file with multiple row groups,
each row is a int, float pair. bug.cpp should repro.
As a side notes – I generally follow the cpp examples, but seems none of the
parquet examples cover the case of multiple rowgroups.
> parquet::arrow::FileReader cannot read multiple RowGroup
> --------------------------------------------------------
>
> Key: ARROW-8138
> URL: https://issues.apache.org/jira/browse/ARROW-8138
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++
> Affects Versions: 0.16.0
> Environment: Centos 7
> Reporter: Feng Tian
> Priority: Major
> Attachments: bug.cpp, bug.parquet
>
>
> When use parquet::arrow::FileReader to read parquet file consisting multiple
> row groups,
> {code:c++}
> reader->RowGroup(i)->Column(c)->Read
> {code}
> It will repeated read data of the first rowgroup.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)