[ 
https://issues.apache.org/jira/browse/HDFS-16646?focusedWorklogId=786515&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-786515
 ]

ASF GitHub Bot logged work on HDFS-16646:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 30/Jun/22 11:05
            Start Date: 30/Jun/22 11:05
    Worklog Time Spent: 10m 
      Work Description: ZanderXu opened a new pull request, #4519:
URL: https://github.com/apache/hadoop/pull/4519

   ### Description of PR
   As we all known, `StaticRouterRpcFairnessPolicyController` is very helpfully 
for RBF to minimize impact of clients connecting to healthy vs unhealthy 
nameNodes. 
   But in prod environment, the traffic of clients accessing each NS and the 
pressure of downstream namenodes are dynamically changed. So if we only have 
one static permit conf, RBF cannot able to adapt to the changes in traffic to 
achieve optimal results. 
   
   So here I propose an elastic RouterRpcFairnessPolicyController to help RBF 
adapt to traffic changes to achieve an optimal result.
   
   The overall idea is:
   - Each name service can configured the exclusive permits like 
`StaticRouterRpcFairnessPolicyController`
   - TotalPermits is more than sum(NsExclusivePermit) and mark TotalPermits - 
sum(NsExclusivePermit) as SharedPermits
   - Each name service can properly preempt the SharedPermits after it's own 
exclusive permits is used up.
   - But the maximum value of SharedPermits preempted by each nameservice 
should be limited. Such as 20% of SharedPermits.
   
   Suppose we have 200 handlers and 5 name services, and each name services 
configured different exclusive Permits, like:
   | NS1 | NS2 | NS3 | NS4 | NS5 | Concurrent NS |
   |

Issue Time Tracking
-------------------

            Worklog Id:     (was: 786515)
    Remaining Estimate: 0h
            Time Spent: 10m

> [RBF] Improved isolation for downstream name nodes. {Elastic}
> -------------------------------------------------------------
>
>                 Key: HDFS-16646
>                 URL: https://issues.apache.org/jira/browse/HDFS-16646
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: ZanderXu
>            Assignee: ZanderXu
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to