Improve the rack-aware replica placement performance
----------------------------------------------------

                 Key: HADOOP-972
                 URL: https://issues.apache.org/jira/browse/HADOOP-972
             Project: Hadoop
          Issue Type: Improvement
          Components: dfs
    Affects Versions: 0.11.0
            Reporter: Hairong Kuang
         Assigned To: Hairong Kuang
             Fix For: 0.12.0


This issue aims to improve the rack-aware replica placement performance. A 
major idea is to avoid constructing lists of possible targets for random 
selection in chooseTarget, which currently needs interating all 
DatanodeDescriptors. I plan to change the NetworkTopology data structure as 
follow:
1. each InnerNode stores its childrens as a list;
2. each InnerNode adds a new field numberOfLeaves the total number of leaves 
(i.e. data nodes) in its subtree. 
NetworkTopology will support two new methods:
1. DatanodeDescriptor chooseRandom( String scope): it randomly choose one leave 
from scope.
2. DatanodeDescriptor chooseRandomExclude(String excludedScope): it randomly 
choose one leave from ~scope

In addition, Issue 971 will also help improve the performance of the rack-aware 
DFS patch.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to