[I] [Python][Parquet] Inconsistent default `pre_buffer` arg between `pyarrow.parquet.ParquetFile` and the other objects and methods [arrow]

via GitHub Sat, 31 May 2025 16:34:53 -0700


JeanCHDJdev opened a new issue, #46658:
URL: https://github.com/apache/arrow/issues/46658


   ### Describe the usage question you have. Please include as many useful 
details as  possible.
   
   
   Hi everyone,
   
   We have been using `pyarrow.parquet.ParquetFile` and 
`pyarrow.parquet.read_table` extensively, and have been hitting errors related 
to `pre_buffer=True` for macOS users on newer versions of macOS (Sequoia). I 
would like to understand two things related to usage : 
   
   - `pyarrow.parquet.ParquetFile` has `pre_buffer=False` as a default 
([doc](https://arrow.apache.org/docs/python/generated/pyarrow.parquet.ParquetFile.html))
 but other bjects such as `pyarrow.parquet.ParquetDataset` 
([doc](https://arrow.apache.org/docs/python/generated/pyarrow.parquet.ParquetDataset.html))
 use `pre_buffer=True`
   - This is also the case for the `pyarrow.parquet.read_table()` method 
([doc](https://arrow.apache.org/docs/python/generated/pyarrow.parquet.read_table.html)),
 which uses `pre_buffer=True`. I'd like to understand why `read_table()` does 
not rely on the defaults provided by the inherited `pyarrow.NativeFile`, if 
provided, e.g. if using `pyarrow.parquet.ParquetFile`, `read_table` would use 
`pre_buffer=False`.
   
   Please let me know if this is not the right place to post this and if these 
issues are real. Thank you for your time.
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] [Python][Parquet] Inconsistent default `pre_buffer` arg between `pyarrow.parquet.ParquetFile` and the other objects and methods [arrow]

Reply via email to