westonpace opened a new issue, #5800:
URL: https://github.com/apache/arrow-rs/issues/5800

   **Describe the bug**
   The local filesystem implementation of `list_with_delimiter` is a two step 
process:
    * First, `walkdir` is used to get the filenames
    * Second, `convert_entry` is called to get the metadata for each file 
   
   If a file is deleted between the first and second step then 
`list_with_delimiter` will return an error:
   
   ```
   Generic LocalFileSystem error: Unable to access metadata for 
tmp/pytest-of-pace/pytest-3/test_compact_with_write_82_1000/dataset/_versions/.tmp_7.manifest_9c100374-3298-4537-afc6-f5ee7913666d
   ```
   
   Note, I suspect that `walkdir` itself may fail if an entire directly is 
deleted while `walkdir` is iterating.  This may be an issue that cannot be 
cleanly solved without writing a custom walk dir implementation that catches 
and swallows "file not found" errors.
   
   **To Reproduce**
   
   Create a dataset with a lot of files
   In one thread call `list_with_delimiter`
   In another thread start deleting the files in the dataset
   
   **Expected behavior**
   There is no error.  Either the file is returned with metadata, or no file is 
returned.
   
   **Additional context**
   N/A


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to