Boris Molodenkov created PARQUET-1119:
-----------------------------------------

             Summary: Reduce memory footprint for nulls caching
                 Key: PARQUET-1119
                 URL: https://issues.apache.org/jira/browse/PARQUET-1119
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-mr
    Affects Versions: 1.9.0
            Reporter: Boris Molodenkov


Scenario:
There is a schema with many optional groups, e.g.
{noformat}
message example {
  required binary id (UTF8);
  optional group a1 (LIST) {
    repeated int64 array;
  }
  optional group a2 (LIST) {
    repeated int64 array;
  }
...
  optional group aN (LIST) {
    repeated int64 array;
  }
}
{noformat}

Many records without optional parameters are written.
In this case groupNullCache will contain many elements which all are zeros.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to