Daniel Becker created IMPALA-13988:
--------------------------------------
Summary: Take Parquet page size into account when estimating
memory for HDFS WRITER
Key: IMPALA-13988
URL: https://issues.apache.org/jira/browse/IMPALA-13988
Project: IMPALA
Issue Type: Improvement
Components: Frontend
Reporter: Daniel Becker
The Iceberg table properties 'write.parquet.page-size-bytes' and
'write.parquet.dict-size-bytes' allow setting the size of Parquet pages. These
page sizes are not taken into account when estimating the memory of the Hdfs
writers, so if they are set to a large value, the query may fail with a
MemoryLimitExceeded error.
Note that before IMPALA-13963, we always incorrectly reserved a default-sized
buffer, so we didn't trigger the memory limit if we didn't actually write more
than the default page size, but if we did, Impala crashed.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)