Rob Reeves created MAPREDUCE-7500:
-------------------------------------
Summary: Support optimistic file renames in the commit protocol
Key: MAPREDUCE-7500
URL: https://issues.apache.org/jira/browse/MAPREDUCE-7500
Project: Hadoop Map/Reduce
Issue Type: Improvement
Components: client
Reporter: Rob Reeves
During a commit in FileOutputCommitter, every file commit checks if a file or
directory exists in the destination and if so deletes it before the rename. The
FileSystem.getFileStatus can take a significant amount of the total commit
time. However, the happy path is that no file exists in the destination so the
getFileStatus call is wasted time. The commit protocol can avoid this time by
optimistically assuming there is no file in the destination and only attempt to
delete it if the rename fails. In our HDFS environment this change reduced the
commit time by 70%.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]