[
https://issues.apache.org/jira/browse/ARROW-15285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Li updated ARROW-15285:
-----------------------------
Issue Type: Bug (was: Improvement)
> [C++] write_dataset with delete_matching occasionally fails with "Path does
> not exist"
> --------------------------------------------------------------------------------------
>
> Key: ARROW-15285
> URL: https://issues.apache.org/jira/browse/ARROW-15285
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++
> Reporter: David Li
> Priority: Major
> Labels: dataset
>
> The reproducer in ARROW-15265, once the bug there is fixed, now occasionally
> fails with this:
> {noformat}
> Traceback (most recent call last):
> File "/home/lidavidm/Code/upstream/arrow-15265/python/test.py", line 37, in
> <module>
> ds.write_dataset(
> File "/home/lidavidm/Code/upstream/arrow-15265/python/pyarrow/dataset.py",
> line 931, in write_dataset
> _filesystemdataset_write(
> File "pyarrow/_dataset.pyx", line 2658, in
> pyarrow._dataset._filesystemdataset_write
> check_status(CFileSystemDataset.Write(c_options, c_scanner))
> File "pyarrow/error.pxi", line 114, in pyarrow.lib.check_status
> raise IOError(message)
> OSError: Path does not exist 'my-bucket/test8.parquet/col1=c' {noformat}
> The path is different each time it fails (i.e. it's not deterministic). This
> is relatively rare (2 out of 100 runs when I checked just now)
--
This message was sent by Atlassian Jira
(v8.20.1#820001)