Boris Molodenkov created PARQUET-1119:
-----------------------------------------
Summary: Reduce memory footprint for nulls caching
Key: PARQUET-1119
URL: https://issues.apache.org/jira/browse/PARQUET-1119
Project: Parquet
Issue Type: Improvement
Components: parquet-mr
Affects Versions: 1.9.0
Reporter: Boris Molodenkov
Scenario:
There is a schema with many optional groups, e.g.
{noformat}
message example {
required binary id (UTF8);
optional group a1 (LIST) {
repeated int64 array;
}
optional group a2 (LIST) {
repeated int64 array;
}
...
optional group aN (LIST) {
repeated int64 array;
}
}
{noformat}
Many records without optional parameters are written.
In this case groupNullCache will contain many elements which all are zeros.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)