Jinpeng Zhou created PARQUET-2321:
-------------------------------------

             Summary: allow customized buffer size when creating 
ArrowInputStream for a column PageReader
                 Key: PARQUET-2321
                 URL: https://issues.apache.org/jira/browse/PARQUET-2321
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-cpp
            Reporter: Jinpeng Zhou
             Fix For: cpp-13.0.0


When buffered stream is enabled, all column chunks, regardless of their actual 
sizes, are currently sharing the same buffer size which is stored in the shared 
[read 
properties]([https://github.com/apache/arrow/blob/main/cpp/src/parquet/file_reader.cc#L213).]
  

Given a limited memory budget, one may want to customize buffer size for 
different column chunks based on their actual size, i.e., smaller chunks will 
use consume less memory budget for its buffer.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to