Improve the performance efficiency of task initialization at the JobTracker ---------------------------------------------------------------------------
Key: HADOOP-6026 URL: https://issues.apache.org/jira/browse/HADOOP-6026 Project: Hadoop Core Issue Type: Improvement Components: mapred Reporter: dhruba borthakur Assignee: Zheng Shao The JobTracker reads the splits for a job at Job Initialization time. Then, for each location in the split, it invokes DNSToSwitchMapping.resolve(). This, in turn, typically invokes an external script that resolves the hostname to a network rack location. The time spent in invoking this external script can be reduced if the hostname and their rack locations are inserted into a cache. JobTracker.resolveAndAddToTopology() can look up this cache first and avoid invoking the external "resolve" script is most cases. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.