Jinpeng Zhou created PARQUET-2321:
-------------------------------------
Summary: allow customized buffer size when creating
ArrowInputStream for a column PageReader
Key: PARQUET-2321
URL: https://issues.apache.org/jira/browse/PARQUET-2321
Project: Parquet
Issue Type: Improvement
Components: parquet-cpp
Reporter: Jinpeng Zhou
Fix For: cpp-13.0.0
When buffered stream is enabled, all column chunks, regardless of their actual
sizes, are currently sharing the same buffer size which is stored in the shared
[read
properties]([https://github.com/apache/arrow/blob/main/cpp/src/parquet/file_reader.cc#L213).]
Given a limited memory budget, one may want to customize buffer size for
different column chunks based on their actual size, i.e., smaller chunks will
use consume less memory budget for its buffer.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)