[ https://issues.apache.org/jira/browse/HADOOP-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632858#action_12632858 ]
Bo Shi commented on HADOOP-4116: -------------------------------- Hi Hairong, I'm interested in testing out your fix - is there, by any chance, a patch against the 0.18.x series? I've only been able to partially apply your patch to to the official 0.18.1 release and it seems there was a major code reorg between 0.18 and 0.19 branches. > Balancer should provide better resource management > -------------------------------------------------- > > Key: HADOOP-4116 > URL: https://issues.apache.org/jira/browse/HADOOP-4116 > Project: Hadoop Core > Issue Type: Bug > Components: dfs > Affects Versions: 0.17.0 > Reporter: Raghu Angadi > Assignee: Hairong Kuang > Priority: Blocker > Fix For: 0.18.2, 0.19.0 > > Attachments: balancerRM.patch > > > The number of threads are currently limited on datanodes. Once these threads > are occupied, DataNode does not accept any more requests (DOS). Recently we > saw a case where most of the 256 threads were waiting in > {{DataXceiver.replaceBlock()}} trying to acquire {{balancingSem}}. Since > rebalancing is (heavily) throttled, I would think this would be the common > case. > These operations waiting for active rebalancing threads to finish need not > take up a thread. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.