George Sakkis created ARROW-5825:
------------------------------------
Summary: [Python] Exceptions swallowed in
ParquetManifest._visit_directories
Key: ARROW-5825
URL: https://issues.apache.org/jira/browse/ARROW-5825
Project: Apache Arrow
Issue Type: Bug
Components: Python
Reporter: George Sakkis
{{ParquetManifest._visit_directories}} uses a {{ThreadPoolExecutor}} to visit
partitioned parquet datasets concurrently, it waits for them to finish but
doesn't check if the respective futures have failed or not. This is quite
tricky to detect and debug as an exception is either raised later as a a
side-effect or (perhaps worse) it passes silently.
Observed on 0.12.1 but appears to be on latest master too.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)