bkietz commented on PR #37514:
URL: https://github.com/apache/arrow/pull/37514#issuecomment-1724339779
This isn't tested and we don't have follow up from the original issue's
poster that this patch fixes the deadlock.
@mapleFU could you add (something like) the original repro as a test? It'd
be best if we can confirm that `main` hangs with a single IO thread and
`use_threads=False`, then confirm that we no longer hang with this fix.
```python
# test.py
import pyarrow.dataset as pads
ds = pads.dataset("/tmp/test.parq", format="parquet") # can be any parquet
ds.count_rows(use_threads=False)
ARROW_IO_THREADS=1 python test.py
```
@yiteng-guo could you try out this patch and confirm it fixes the deadlock
for you?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]