GitHub user pitrou added a comment to the discussion: It is possible to reduce peak memory usage when using datasets (to use predicate pushdown) when reading single parquet files
Also you can try to use another MemoryPool to see if that helps. See https://arrow.apache.org/docs/cpp/env_vars.html#envvar-ARROW_DEFAULT_MEMORY_POOL GitHub link: https://github.com/apache/arrow/discussions/47003#discussioncomment-13716773 ---- This is an automatically sent email for user@arrow.apache.org. To unsubscribe, please send an email to: user-unsubscr...@arrow.apache.org