K S created HADOOP-16378: ---------------------------- Summary: RawLocalFileStatus throws exception if a file is created and deleted quickly Key: HADOOP-16378 URL: https://issues.apache.org/jira/browse/HADOOP-16378 Project: Hadoop Common Issue Type: Bug Components: fs Reporter: K S
Bug occurs when Hadoop creates temporary ".nfs*" files as part of file moves and accesses. If this file is deleted very quickly after being created, a RuntimeException is thrown. The root cause is in the loadPermissionInfo method in org.apache.hadoop.fs.RawLocalFileSystem. To get the permission info, it first does {code:java} ls -ld{code} and then attempts to get permissions info about each file. If a file disappears between these two steps, an exception is thrown. *Reproduction Steps:* An isolated way to reproduce the bug is to run FileInputFormat.listStatus over and over on the same dir that we’re creating those temp files in. On Ubuntu or any other Linux-based system, this should fail intermittently. On MacOS (due to differences in how `ls` returns status codes) this should not fail. *Fix:* One way in which we managed to fix this was to ignore the exception being thrown in loadPemissionInfo() if the exit code is 1 or 2. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-dev-h...@hadoop.apache.org