[ 
https://issues.apache.org/jira/browse/HDFS-9260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15966423#comment-15966423
 ] 

Andrew Wang commented on HDFS-9260:
-----------------------------------

Hi Daryn,

I talked to our people who help run this large customer cluster. It's at about 
350 million blocks, so a pretty good size, but also a lot denser than the last 
published stats I saw about the 4500-node Yahoo cluster. We don't have 
historical GC metrics going back a year when we put this into CDH, but they 
haven't seen anything abnormal in terms of GC.

They were quite interested your balancer settings though, since we haven't seen 
it stressing the NN. Could you provide the following?

{noformat}
dfs.datanode.balance.bandwidthPerSec
dfs.datanode.balance.max.concurrent.moves
dfs.namenode.replication.work.multiplier.per.iteration
dfs.namenode.replication.max-streams-hard-limit
{noformat}

I believe we're running it with mostly default settings like this:

{noformat}
hdfs balancer -Ddfs.datanode.balance.max.concurrent.moves=200 -threshold 10
{noformat}

> Improve the performance and GC friendliness of NameNode startup and full 
> block reports
> --------------------------------------------------------------------------------------
>
>                 Key: HDFS-9260
>                 URL: https://issues.apache.org/jira/browse/HDFS-9260
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: datanode, namenode, performance
>    Affects Versions: 2.7.1
>            Reporter: Staffan Friberg
>            Assignee: Staffan Friberg
>             Fix For: 3.0.0-alpha1
>
>         Attachments: FBR processing.png, HDFS-7435.001.patch, 
> HDFS-7435.002.patch, HDFS-7435.003.patch, HDFS-7435.004.patch, 
> HDFS-7435.005.patch, HDFS-7435.006.patch, HDFS-7435.007.patch, 
> HDFS-9260.008.patch, HDFS-9260.009.patch, HDFS-9260.010.patch, 
> HDFS-9260.011.patch, HDFS-9260.012.patch, HDFS-9260.013.patch, 
> HDFS-9260.014.patch, HDFS-9260.015.patch, HDFS-9260.016.patch, 
> HDFS-9260.017.patch, HDFS-9260.018.patch, HDFSBenchmarks2.zip, 
> HDFSBenchmarks.zip, HDFS Block and Replica Management 20151013.pdf
>
>
> This patch changes the datastructures used for BlockInfos and Replicas to 
> keep them sorted. This allows faster and more GC friendly handling of full 
> block reports.
> Would like to hear peoples feedback on this change.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to