[
https://issues.apache.org/jira/browse/PARQUET-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated PARQUET-2321:
------------------------------------
Labels: pull-request-available (was: )
> allow customized buffer size when creating ArrowInputStream for a column
> PageReader
> -----------------------------------------------------------------------------------
>
> Key: PARQUET-2321
> URL: https://issues.apache.org/jira/browse/PARQUET-2321
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-cpp
> Reporter: Jinpeng Zhou
> Priority: Minor
> Labels: pull-request-available
> Fix For: cpp-13.0.0
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> When buffered stream is enabled, all column chunks, regardless of their
> actual sizes, are currently sharing the same buffer size which is stored in
> the shared [read
> properties]([https://github.com/apache/arrow/blob/main/cpp/src/parquet/file_reader.cc#L213).]
>
> Given a limited memory budget, one may want to customize buffer size for
> different column chunks based on their actual size, i.e., smaller chunks will
> use consume less memory budget for its buffer.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)