[ 
https://issues.apache.org/jira/browse/PARQUET-895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned PARQUET-895:
------------------------------------

    Assignee: Marc Vertes

> Reading of nested columns is broken
> -----------------------------------
>
>                 Key: PARQUET-895
>                 URL: https://issues.apache.org/jira/browse/PARQUET-895
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-cpp
>            Reporter: Marc Vertes
>            Assignee: Marc Vertes
>
> Problem occurs when reading a nested column with repeated values, specially 
> when there is much more levels in that column than the number of global rows.
> Citing @peshopetrov, who filed a github pull request identifying the problem 
> and proposing a fix:
> Nested repeated columns' count is incorrectly read from row group's metadata. 
> That's correct in cases where there aren't any nested repeated fields but is 
> generally not correct. Instead the num_values from the column's metadata 
> should be used.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to