Micah Kornfield created PARQUET-2067:
----------------------------------------
Summary: [C++] null_count and num_nulls incorrect for repeated
columns
Key: PARQUET-2067
URL: https://issues.apache.org/jira/browse/PARQUET-2067
Project: Parquet
Issue Type: Bug
Components: parquet-cpp
Reporter: Micah Kornfield
Currently only nulls at the leaf are accounted for in the null count statstics.
For nested lists this is incorrect because null lists have zero elements and
don't show up in the leaf.
Example from mailing list discussion
[[0, 1], None, [2, None, 3]]
should have a null count of 2 (it currently reports as 1).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)