[
https://issues.apache.org/jira/browse/HDFS-10290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16465002#comment-16465002
]
xiaoli commented on HDFS-10290:
-------------------------------
try:https://issues.apache.org/jira/browse/HDFS-13183
> Move getBlocks calls to DataNode in Balancer
> --------------------------------------------
>
> Key: HDFS-10290
> URL: https://issues.apache.org/jira/browse/HDFS-10290
> Project: Hadoop HDFS
> Issue Type: New Feature
> Components: balancer & mover
> Affects Versions: 2.6.0
> Reporter: He Tianyi
> Priority: Major
>
> In current implementation, Balancer asks NameNode for a list of blocks on
> specific DataNode. This made workload of NameNode heavier, and actually it
> caused NameNode flappy when average # of blocks on each DataNode reaches
> 1,000,000 (NameNode heap size is 192GB, cpu: Xeon E5-2630 * 2).
> Recently I investigated whether {{getBlocks}} invocation from Balancer can be
> handled by DataNodes, turned out to be practical.
> The only pitfall is: since DataNode has no information about other locations
> of each block it possesses, some block move may fail (since target node may
> already has a replica of that particular block).
> I think this may be beneficial for large clusters.
> Any suggestions or comments?
> Thanks.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]