[
https://issues.apache.org/jira/browse/PARQUET-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gang Wu resolved PARQUET-2066.
------------------------------
Assignee: Gang Wu
Resolution: Fixed
This is fixed: [GH-34086: [C++][Parquet] Fix writing num_rows to data page v2
by wgtmac · Pull Request #34096 · apache/arrow
(github.com)|https://github.com/apache/arrow/pull/34096]
> [C++][Parquet] num_rows is incorrect for nested types
> -----------------------------------------------------
>
> Key: PARQUET-2066
> URL: https://issues.apache.org/jira/browse/PARQUET-2066
> Project: Parquet
> Issue Type: Bug
> Components: parquet-cpp
> Reporter: Jorge Leitão
> Assignee: Gang Wu
> Priority: Major
>
> Data pages v2 have:
> * num_rows
> * num_values
> we write num_rows equal to the num_values. However, they represent different
> aspects.
> Given a list such as "[[0, 1], None, [2, None, 3]]", num_rows = 3 and
> num_values = 6. We currently report 6 in both instances.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]