Re: [D] It is possible to reduce peak memory usage when using datasets (to use predicate pushdown) when reading single parquet files [arrow]

via GitHub Tue, 15 Jul 2025 07:53:02 -0700


GitHub user pitrou added a comment to the discussion: It is possible to reduce 
peak memory usage when using datasets (to use predicate pushdown) when reading 
single parquet files


In addition to `batch_readahead`, you can also try the 
[`cache_metadata`](https://arrow.apache.org/docs/cpp/api/dataset.html#_CPPv4N5arrow7dataset11ScanOptions14cache_metadataE)
 option.

GitHub link: 
https://github.com/apache/arrow/discussions/47003#discussioncomment-13765913

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Re: [D] It is possible to reduce peak memory usage when using datasets (to use predicate pushdown) when reading single parquet files [arrow]

Reply via email to