Wes McKinney created PARQUET-1642:
-------------------------------------
Summary: [C++] Provide for readahead-buffering in column readers
Key: PARQUET-1642
URL: https://issues.apache.org/jira/browse/PARQUET-1642
Project: Parquet
Issue Type: Improvement
Components: parquet-cpp
Reporter: Wes McKinney
Currently we support two modes of reading a column chunk:
* Read entire chunk into memory before beginning to deserialize
* Use {{arrow::io::BufferedInputStream}} to buffer reads, where IO calls are
triggered when a buffer is consumed
One downside to the latter case is that the IO may sit idle while
deserialization is happening. It might be preferable to allow "lookahead
buffering" (possibly using "ReadaheadSpooler") so that data will continue to be
requested in the background
--
This message was sent by Atlassian Jira
(v8.3.2#803003)