jorisvandenbossche commented on pull request #7156:
URL: https://github.com/apache/arrow/pull/7156#issuecomment-628605330
@bkietz Cool, I am testing this out.
Something like
```
import pyarrow.dataset as ds
with open("test.parquet", 'rb') as f:
dataset = ds.dataset(f)
```
currently does not yet work. This is because you are checking for
`io.BytesIO` in the `dataset()` constructor, while the above `open(..)` gives a
`io.BufferedReader`, which is apparently not a subclass from BytestIO.
Now, I am not fully familiar with the class hierarchy of the Python `io`
module, so will need to look into that a bit. Their common base class might be
`BufferedIOBase` (https://docs.python.org/3/library/io.html#binary-i-o).
I also noticed that it is easy to segfault Fragment, because we don't forbid
the `__init__` constructor, but that's not related to the changes in this PR.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]