Make getGenerationStampFromFile() more efficient, so it doesn't reprocess full
directory listing for every block
----------------------------------------------------------------------------------------------------------------
Key: HDFS-1447
URL: https://issues.apache.org/jira/browse/HDFS-1447
Project: Hadoop HDFS
Issue Type: Sub-task
Components: data-node
Affects Versions: 0.20.2
Reporter: Matt Foley
Assignee: Matt Foley
Fix For: 0.22.0
Make getGenerationStampFromFile() more efficient. Currently this routine is
called by addToReplicasMap() for every blockfile in the directory tree, and it
walks each file's containing directory on every call. There is a simple
refactoring that should make it more efficient.
This work item is one of four sub-tasks for HDFS-1443, Improve Datanode startup
time.
The fix will probably be folded into sibling task HDFS-1446, which is already
refactoring the method that calls getGenerationStampFromFile().
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.