Could you give more details like, - Could you convert 7% to the total amount of moved data in MBs.
- Also, could you tell me 7% data movement per DN ? - What values showing for the ‘over-utilized’, ‘above-average’, ‘below-average’, ‘below-average’ nodes. Balancer will do the pairing based on these values. - Please tell me the cluster topology - SAME_NODE_GROUP, SAME_RACK. Basically this will matters when choosing the sourceNode vs balancerNode pairs as well as the proxy source. Did you see all the DNs are getting utilized for the block movement. - Any exceptions occurred when block movement - How many iterations played in these hours -Rakesh From: ch huang [mailto:[email protected]] Sent: 06 May 2014 06:10 To: [email protected] Subject: issue about cluster balance hi,maillist: i have a 5-node hadoop cluster,and yesterday i add 5 new box into my cluster,after that i start balance task,but it move only 7% data to new node in 20 hour , and i already set dfs.datanode.balance.bandwidthPerSec 10M ,and the threshold is 10%,why the balance task take long time ?
