[
https://issues.apache.org/jira/browse/PARQUET-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17784252#comment-17784252
]
Jinpeng Zhou commented on PARQUET-2321:
---------------------------------------
I think we can close this one for now as it may not worth making all these
changes for a certain scenario. I'll revisit this if it became more favorable.
Thanks for all the comments and reviews.
> allow customized buffer size when creating ArrowInputStream for a column
> PageReader
> -----------------------------------------------------------------------------------
>
> Key: PARQUET-2321
> URL: https://issues.apache.org/jira/browse/PARQUET-2321
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-cpp
> Reporter: Jinpeng Zhou
> Priority: Minor
> Labels: pull-request-available
> Fix For: cpp-15.0.0
>
> Time Spent: 6.5h
> Remaining Estimate: 0h
>
> When buffered stream is enabled, all column chunks, regardless of their
> actual sizes, are currently sharing the same buffer size which is stored in
> the shared [read
> properties]([https://github.com/apache/arrow/blob/main/cpp/src/parquet/file_reader.cc#L213).]
>
> Given a limited memory budget, one may want to customize buffer size for
> different column chunks based on their actual size, i.e., smaller chunks will
> use consume less memory budget for its buffer.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)