[ 
https://issues.apache.org/jira/browse/PARQUET-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Wu resolved PARQUET-2066.
------------------------------
      Assignee: Gang Wu
    Resolution: Fixed

This is fixed: [GH-34086: [C++][Parquet] Fix writing num_rows to data page v2 
by wgtmac · Pull Request #34096 · apache/arrow 
(github.com)|https://github.com/apache/arrow/pull/34096]

> [C++][Parquet] num_rows is incorrect for nested types
> -----------------------------------------------------
>
>                 Key: PARQUET-2066
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2066
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-cpp
>            Reporter: Jorge Leitão
>            Assignee: Gang Wu
>            Priority: Major
>
> Data pages v2 have:
> * num_rows
> * num_values
> we write num_rows equal to the num_values. However, they represent different 
> aspects.
> Given a list such as "[[0, 1], None, [2, None, 3]]", num_rows = 3 and 
> num_values = 6. We currently report 6 in both instances.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to