[
https://issues.apache.org/jira/browse/HDFS-14894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16949836#comment-16949836
]
Leon Gao commented on HDFS-14894:
---------------------------------
Hi [~kihwal] , Could you take a look at the PR and let me know any thoughts?
Thanks!
> Add balancer parameter to balance top used nodes
> ------------------------------------------------
>
> Key: HDFS-14894
> URL: https://issues.apache.org/jira/browse/HDFS-14894
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: balancer & mover
> Reporter: Leon Gao
> Assignee: Leon Gao
> Priority: Major
>
> We sometimes see a few of our datanodes reach very high usage (due to various
> reasons) and we need to reduce their usage in an urgent situation.
> We see two ways to achieve it currently,
> -Calculate and reset balancing threshold.
> -Pick nodes manually according to usage stats and put them in a file and use
> `-resource` flag.
> However, both of them are not very intuitive or too much manual work in an
> urgent close-to-outage situation. Add a small feature to automatically pick
> top used hosts will be a straightforward option, for example
> `-sourceThreshold 95` to only target datanodes with >95% usage.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]