Li created PARQUET-792:
--------------------------
Summary: Skip the storage of repetition level and definition level
for all-null column
Key: PARQUET-792
URL: https://issues.apache.org/jira/browse/PARQUET-792
Project: Parquet
Issue Type: Improvement
Components: parquet-mr
Reporter: Li
Priority: Minor
I have a very sparse protobuf message in my project, with thousands of fields.
In practise, most of the fields are all null values in one page.
But the repetition level and definition level takes lots of storage space.
Can parquet skip the storage of r level and d level for such all-null columns
to save storage space?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)