[ 
https://issues.apache.org/jira/browse/HBASE-29970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiao Liu updated HBASE-29970:
-----------------------------
    Description: 
This change removes the unused \{{incrSplitSuccess()}}, \{{updateSplitTime()}}, 
and related split success/time metrics from RegionServer and Table level 
metrics sources.

These metrics \{{splitSuccess}} and \{{splitTime}} is introduced by 
HBASE-12779, the split operation has been migrated from RegionServer-driven 
local execution to Master-driven Procedure-based coordination. When a 
RegionServer triggers a split due to region size threshold, it only sends a 
\{{READY_TO_SPLIT}} state transition request to the Master via 
\{{reportRegionStateTransition()}}. The Master then creates a 
\{{SplitTableRegionProcedure}} to asynchronously coordinate the entire split 
process. The RegionServer has no knowledge of when (or whether) the split 
completes successfully, so it cannot update split success or timing metrics, in 
HBASE-16549 we refactor split metric logic and start to use 
\{{splitProcMetrics}} tracking the split behaivor

h3. Current Behavior


* Split request metrics (\{{splitRequestCount}}) are properly updated when a 
split is initiated
* Split success/time metrics at RegionServer and Table level were defined but 
never called
* Master-side split metrics (\{{splitSubmittedCount}}, \{{splitTime}}, 
\{{splitFailedCount}}) are correctly maintained via the Procedure framework's 
\{{getProcedureMetrics()}} mechanism

h3. Changes Made


* Removed \{{incrSplitSuccess()}} and \{{updateSplitTime()}} methods from 
\{{MetricsRegionServerSource}} and \{{MetricsRegionServerSourceImpl}}, 
corresponding methods from \{{MetricsTableSource}} and 
\{{MetricsTableSourceImpl}}, and related metric definitions and histogram 
variables
* We keep \{{SplitRequest}} and support table-level updates for this metic to 
track split caused by region size exceeds the configured max size
* 
\{code:language=|borderStyle=solid|theme=RDark|linenumbers=true|collapse=false}
* CompactSplit.requestSplit()
* → new SplitRequest(r, midKey, server, user)
* → splits.execute(splitRequest)
* → SplitRequest.run()
* → doSplitting()
* → server.getMetrics().incrSplitRequest(tableName)
* \{code}

 

> SplitSuccess and SplitTime metrics are no longer used at RegionServer and 
> Table level
> -------------------------------------------------------------------------------------
>
>                 Key: HBASE-29970
>                 URL: https://issues.apache.org/jira/browse/HBASE-29970
>             Project: HBase
>          Issue Type: Improvement
>          Components: metrics
>            Reporter: Xiao Liu
>            Assignee: Xiao Liu
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 2.7.0, 3.0.0-beta-2, 2.6.5, 2.5.14
>
>
> This change removes the unused \{{incrSplitSuccess()}}, 
> \{{updateSplitTime()}}, and related split success/time metrics from 
> RegionServer and Table level metrics sources.
> These metrics \{{splitSuccess}} and \{{splitTime}} is introduced by 
> HBASE-12779, the split operation has been migrated from RegionServer-driven 
> local execution to Master-driven Procedure-based coordination. When a 
> RegionServer triggers a split due to region size threshold, it only sends a 
> \{{READY_TO_SPLIT}} state transition request to the Master via 
> \{{reportRegionStateTransition()}}. The Master then creates a 
> \{{SplitTableRegionProcedure}} to asynchronously coordinate the entire split 
> process. The RegionServer has no knowledge of when (or whether) the split 
> completes successfully, so it cannot update split success or timing metrics, 
> in HBASE-16549 we refactor split metric logic and start to use 
> \{{splitProcMetrics}} tracking the split behaivor
> h3. Current Behavior
> * Split request metrics (\{{splitRequestCount}}) are properly updated when a 
> split is initiated
> * Split success/time metrics at RegionServer and Table level were defined but 
> never called
> * Master-side split metrics (\{{splitSubmittedCount}}, \{{splitTime}}, 
> \{{splitFailedCount}}) are correctly maintained via the Procedure framework's 
> \{{getProcedureMetrics()}} mechanism
> h3. Changes Made
> * Removed \{{incrSplitSuccess()}} and \{{updateSplitTime()}} methods from 
> \{{MetricsRegionServerSource}} and \{{MetricsRegionServerSourceImpl}}, 
> corresponding methods from \{{MetricsTableSource}} and 
> \{{MetricsTableSourceImpl}}, and related metric definitions and histogram 
> variables
> * We keep \{{SplitRequest}} and support table-level updates for this metic to 
> track split caused by region size exceeds the configured max size
> * 
> \{code:language=|borderStyle=solid|theme=RDark|linenumbers=true|collapse=false}
> * CompactSplit.requestSplit()
> * → new SplitRequest(r, midKey, server, user)
> * → splits.execute(splitRequest)
> * → SplitRequest.run()
> * → doSplitting()
> * → server.getMetrics().incrSplitRequest(tableName)
> * \{code}
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to