[ 
https://issues.apache.org/jira/browse/HDFS-16678?focusedWorklogId=793769&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-793769
 ]

ASF GitHub Bot logged work on HDFS-16678:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 21/Jul/22 14:31
            Start Date: 21/Jul/22 14:31
    Worklog Time Spent: 10m 
      Work Description: ZanderXu opened a new pull request, #4606:
URL: https://github.com/apache/hadoop/pull/4606

   ### Description of PR
   In our prod environment, we try to collect RBF metrics every 15s through 
jmx_exporter. And we found that collection task often failed. 
   
   After tracing and found that the collection task is blocked at 
getNodeUsage() in RBFMetrics, because it will collect all datanode's usage from 
downstream nameservices.  
   
   This is a very expensive and almost useless operation. Because in most 
scenarios, each downstream nameserivce contains almost the same DNs. We can get 
the data usage's from any one nameservices if need, not from RBF.
   
   So I feel that RBF should supports disable getNodeUsage() in RBFMetrics.
   
   




Issue Time Tracking
-------------------

            Worklog Id:     (was: 793769)
    Remaining Estimate: 0h
            Time Spent: 10m

> RBF supports disable getNodeUsage() in RBFMetrics
> -------------------------------------------------
>
>                 Key: HDFS-16678
>                 URL: https://issues.apache.org/jira/browse/HDFS-16678
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: ZanderXu
>            Assignee: ZanderXu
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> In our prod environment, we try to collect RBF metrics every 15s through 
> jmx_exporter. And we found that collection task often failed. 
> After tracing and found that the collection task is blocked at getNodeUsage() 
> in RBFMetrics, because it will collection all datanode's usage from 
> downstream nameservices.  This is a very expensive and almost useless 
> operation. Because in most scenarios, each NameSerivce contains almost the 
> same DNs. We can get the data usage's from any one nameservices, not from RBF.
> So I feel that RBF should supports disable getNodeUsage() in RBFMetrics.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to