K S created HADOOP-16378:
----------------------------

             Summary: RawLocalFileStatus throws exception if a file is created 
and deleted quickly
                 Key: HADOOP-16378
                 URL: https://issues.apache.org/jira/browse/HADOOP-16378
             Project: Hadoop Common
          Issue Type: Bug
          Components: fs
            Reporter: K S


Bug occurs when Hadoop creates temporary ".nfs*" files as part of file moves 
and accesses. If this file is deleted very quickly after being created, a 
RuntimeException is thrown. The root cause is in the loadPermissionInfo method 
in org.apache.hadoop.fs.RawLocalFileSystem. To get the permission info, it 
first does

 
{code:java}
ls -ld{code}
 and then attempts to get permissions info about each file. If a file 
disappears between these two steps, an exception is thrown.

*Reproduction Steps:*

An isolated way to reproduce the bug is to run FileInputFormat.listStatus over 
and over on the same dir that we’re creating those temp files in. On Ubuntu or 
any other Linux-based system, this should fail intermittently. On MacOS (due to 
differences in how `ls` returns status codes) this should not fail. 

*Fix:*

One way in which we managed to fix this was to ignore the exception being 
thrown in loadPemissionInfo() if the exit code is 1 or 2.

 

 

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to