[ 
https://issues.apache.org/jira/browse/HDFS-6599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14042481#comment-14042481
 ] 

Kihwal Lee commented on HDFS-6599:
----------------------------------

This is caused by a change in {{BlockPlacementPolicyDefault#isGoodTarget()}}. 
The synchronization overhead in {{HeartBeatManager}} or {{DatanodeManager}} 
might also be worse.

In 0.23 it didn't go there at all. It just looked at the topology.
   {{int size = clusterMap.getNumOfLeaves()}}

In 2.4.x, it is calling this:
   {{int size = stats.getNumDatanodesInService()}}

Since the network topology doesn't tell you the accurate state of the cluster, 
it is a right thing to do semantically.  But its performance impact should have 
been considered. This change was made in HDFS-5837.

> 2.4 addBlock is 10 to 20 times slower compared to 0.23
> ------------------------------------------------------
>
>                 Key: HDFS-6599
>                 URL: https://issues.apache.org/jira/browse/HDFS-6599
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 2.3.0, 2.4.0
>            Reporter: Kihwal Lee
>            Priority: Critical
>
> From one of our busiest 0.23 clusters:
> {panel}
> "AddBlockAvgTime" : 0.9514711501719515
> "CreateAvgTime" : 1.7564162389174
> "CompleteAvgTime" : 1.3310406035056548
> "BlockReceivedAndDeletedAvgTime" : 0.661210005151392
> {panel}
> From a not-so-busy 2.4 cluster:
> {panel}
> "AddBlockAvgTime" : 10.083333333333334
> "CreateAvgTime" : 1.0
> "CompleteAvgTime" : 1.1111111111111112
> "BlockReceivedAndDeletedAvgTime" : 0.07692307692307694
> {panel}
> When the 2.4 cluster gets a moderate amount of write requests, the latency is 
> terrible. E.g. addBlock goes upward of 60ms. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to