This is an automated email from the ASF dual-hosted git repository.
gongchao pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hertzbeat.git
The following commit(s) were added to refs/heads/master by this push:
new 4a3e27382 [feature] add Apache Hbase RegionServer monitoring (#1833)
4a3e27382 is described below
commit 4a3e27382c2111cbe700afea6ccf93b8855cfe06
Author: Jast <[email protected]>
AuthorDate: Thu Apr 25 14:30:39 2024 +0800
[feature] add Apache Hbase RegionServer monitoring (#1833)
Co-authored-by: zhangshenghang <[email protected]>
Co-authored-by: zhangshenghang <[email protected]>
Co-authored-by: tomsun28 <[email protected]>
---
home/docs/help/hbase_regionserver.md | 96 ++++
.../current/help/hbase_master.md | 2 +-
.../current/help/hbase_regionserver.md | 97 ++++
home/sidebars.json | 2 +
.../resources/define/app-hbase_regionserver.yml | 578 +++++++++++++++++++++
5 files changed, 774 insertions(+), 1 deletion(-)
diff --git a/home/docs/help/hbase_regionserver.md
b/home/docs/help/hbase_regionserver.md
new file mode 100644
index 000000000..86d9c7e84
--- /dev/null
+++ b/home/docs/help/hbase_regionserver.md
@@ -0,0 +1,96 @@
+---
+id: hbase_regionserver
+title: Monitoring HBase RegionServer Monitoring
+sidebar_label: HBase RegionServer Monitoring
+keywords: [Open-source monitoring system, Open-source database monitoring,
RegionServer monitoring]
+---
+> Collect and monitor common performance metrics for HBase RegionServer.
+
+**Protocol:** HTTP
+
+## Pre-Monitoring Operations
+
+Review the `hbase-site.xml` file to obtain the value of the
`hbase.regionserver.info.port` configuration item, which is used for monitoring.
+
+## Configuration Parameters
+
+
+| Parameter Name | Parameter Description
|
+| ------------------- |
--------------------------------------------------------------------------------------------------------------------------------------------------
|
+| Target Host | The IPV4, IPV6, or domain name of the monitored
entity. Note ⚠️ Do not include the protocol header (e.g., https://, http://).
|
+| Port | The port number of the HBase regionserver, default is
16030, i.e., the value of the`hbase.regionserver.info.port` parameter
|
+| Task Name | A unique name to identify this monitoring task.
|
+| Query Timeout | Set the timeout for Kafka connections in milliseconds,
default is 3000 ms.
|
+| Collection Interval | The interval time for periodic data collection in
seconds, with a minimum interval of 30 seconds.
|
+| Probe Before Adding | Whether to probe and check the availability of
monitoring before adding new monitoring, only proceed with the addition if the
probe is successful. |
+| Description Note | Additional notes to identify and describe this
monitoring, users can add notes here.
|
+
+### Collection Metrics
+
+> All metric names are directly referenced from the official fields, hence
there may be non-standard naming.
+
+#### Metric Set: server
+
+
+| Metric Name | Unit | Metric Description
|
+| --------------------------------- | ----- |
------------------------------------------------------------------------- |
+| regionCount | None | Number of Regions
|
+| readRequestCount | None | Number of read requests since
cluster restart |
+| writeRequestCount | None | Number of write requests since
cluster restart |
+| averageRegionSize | MB | Average size of a Region
|
+| totalRequestCount | None | Total number of requests
|
+| ScanTime_num_ops | None | Total number of Scan requests
|
+| Append_num_ops | None | Total number of Append requests
|
+| Increment_num_ops | None | Total number of Increment
requests |
+| Get_num_ops | None | Total number of Get requests
|
+| Delete_num_ops | None | Total number of Delete requests
|
+| Put_num_ops | None | Total number of Put requests
|
+| ScanTime_mean | None | Average time of a Scan request
|
+| ScanTime_min | None | Minimum time of a Scan request
|
+| ScanTime_max | None | Maximum time of a Scan request
|
+| ScanSize_mean | bytes | Average size of a Scan request
|
+| ScanSize_min | None | Minimum size of a Scan request
|
+| ScanSize_max | None | Maximum size of a Scan request
|
+| slowPutCount | None | Number of slow Put operations
|
+| slowGetCount | None | Number of slow Get operations
|
+| slowAppendCount | None | Number of slow Append operations
|
+| slowIncrementCount | None | Number of slow Increment
operations |
+| slowDeleteCount | None | Number of slow Delete operations
|
+| blockCacheSize | None | Size of memory used by block
cache |
+| blockCacheCount | None | Number of blocks in Block Cache
|
+| blockCacheExpressHitPercent | None | Block cache hit ratio
|
+| memStoreSize | None | Size of Memstore
|
+| FlushTime_num_ops | None | Number of RS writes to
disk/Memstore flushes |
+| flushQueueLength | None | Length of Region Flush queue
|
+| flushedCellsSize | None | Size flushed to disk
|
+| storeFileCount | None | Number of Storefiles
|
+| storeCount | None | Number of Stores
|
+| storeFileSize | None | Size of Storefiles
|
+| compactionQueueLength | None | Length of Compaction queue
|
+| percentFilesLocal | None | Percentage of HFile in local
HDFS Data Node |
+| percentFilesLocalSecondaryRegions | None | Percentage of HFile for
secondary region replicas in local HDFS Data Node |
+| hlogFileCount | None | Number of WAL files
|
+| hlogFileSize | None | Size of WAL files
|
+
+#### Metric Set: IPC
+
+
+| Metric Name | Unit | Metric Description |
+| ------------------------- | ---- | -------------------------------------- |
+| numActiveHandler | None | Current number of RITs |
+| NotServingRegionException | None | Number of RITs exceeding the threshold |
+| RegionMovedException | ms | Duration of the oldest RIT |
+| RegionTooBusyException | ms | Duration of the oldest RIT |
+
+#### Metric Set: JVM
+
+
+| Metric Name | Unit | Metric Description |
+| -------------------- | ---- | --------------------------------- |
+| MemNonHeapUsedM | None | Current active RegionServer list |
+| MemNonHeapCommittedM | None | Current offline RegionServer list |
+| MemHeapUsedM | None | Zookeeper list |
+| MemHeapCommittedM | None | Master node |
+| MemHeapMaxM | None | Cluster balance load times |
+| MemMaxM | None | RPC handle count |
+| GcCount | MB | Cluster data reception volume |
diff --git
a/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/hbase_master.md
b/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/hbase_master.md
index 79d5a7f9b..6d490ae80 100644
---
a/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/hbase_master.md
+++
b/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/hbase_master.md
@@ -1,7 +1,7 @@
---
id: hbase_master
title: 监控:Hbase Master监控
-sidebar_label: HbaseMaster监控
+sidebar_label: Apache Hbase Master
keywords: [开源监控系统, 开源数据库监控, HbaseMaster监控]
---
> 对Hbase Master的通用性能指标进行采集监控
diff --git
a/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/hbase_regionserver.md
b/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/hbase_regionserver.md
new file mode 100644
index 000000000..d0a1d1290
--- /dev/null
+++
b/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/hbase_regionserver.md
@@ -0,0 +1,97 @@
+---
+id: hbase_regionserver
+title: 监控 Hbase RegionServer监控
+sidebar_label: Apache Hbase RegionServer
+keywords: [开源监控系统, 开源数据库监控, RegionServer监控]
+---
+> 对Hbase RegionServer的通用性能指标进行采集监控
+
+**使用协议:HTTP**
+
+## 监控前操作
+
+查看 `hbase-site.xml` 文件,获取 `hbase.regionserver.info.port` 配置项的值,该值用作监控使用。
+
+## 配置参数
+
+
+| 参数名称 | 参数帮助描述
|
+| ------------
|---------------------------------------------------------------------|
+| 目标Host | 被监控的对端IPV4,IPV6或域名。注意⚠️不带协议头(eg: https://, http://)。
|
+| 端口 | hbase
regionserver的端口号,默认为16030。即:`hbase.regionserver.info.port`参数值 |
+| 任务名称 | 标识此监控的名称,名称需要保证唯一性。
|
+| 查询超时时间 | 设置Kafka连接的超时时间,单位ms毫秒,默认3000毫秒。
|
+| 采集间隔 | 监控周期性采集数据间隔时间,单位秒,可设置的最小间隔为30秒
|
+| 是否探测 | 新增监控前是否先探测检查监控可用性,探测成功才会继续新增修改操作
|
+| 描述备注 | 更多标识和描述此监控的备注信息,用户可以在这里备注信息
|
+
+### 采集指标
+
+> 所有指标名称均直接引用官方的字段,所以存在命名不规范。
+
+#### 指标集合:server
+
+
+| 指标名称 | 指标单位 | 指标帮助描述 |
+| -------------------- |-------|------------------------------------------|
+| regionCount | 无 | Region数量 |
+| readRequestCount | 无 | 重启集群后的读请求数量 |
+| writeRequestCount | 无 | 重启集群后的写请求数量
|
+| averageRegionSize | MB | 平均Region大小 |
+| totalRequestCount | 无 | 全部请求数量
|
+| ScanTime_num_ops | 无 | Scan 请求总量 |
+| Append_num_ops | 无 | Append 请求量 |
+| Increment_num_ops | 无 | Increment请求量
|
+| Get_num_ops | 无 | Get 请求量 |
+| Delete_num_ops | 无 | Delete 请求量 |
+| Put_num_ops | 无 | Put 请求量 |
+| ScanTime_mean | 无 | 平均 Scan 请求时间 |
+| ScanTime_min | 无 | 最小 Scan 请求时间 |
+| ScanTime_max | 无 | 最大 Scan 请求时间 |
+| ScanSize_mean | bytes | 平均 Scan 请求大小 |
+| ScanSize_min | 无 | 最小 Scan 请求大小 |
+| ScanSize_max | 无 | 最大 Scan 请求大小 |
+| slowPutCount | 无 | 慢操作次数/Put |
+| slowGetCount | 无 | 慢操作次数/Get |
+| slowAppendCount | 无 | 慢操作次数/Append |
+| slowIncrementCount | 无 | 慢操作次数/Increment |
+| slowDeleteCount | 无 | 慢操作次数/Delete |
+| blockCacheSize | 无 | 缓存块内存占用大小 |
+| blockCacheCount | 无 | 缓存块数量_Block Cache 中的 Block 数量 |
+| blockCacheExpressHitPercent | 无 | 读缓存命中率
|
+| memStoreSize | 无 | Memstore 大小 |
+| FlushTime_num_ops | 无 | RS写磁盘次数/Memstore flush 写磁盘次数
|
+| flushQueueLength | 无 | Region Flush 队列长度
|
+| flushedCellsSize | 无 | flush到磁盘大小
|
+| storeFileCount | 无 | Storefile 个数
|
+| storeCount | 无 | Store 个数 |
+| storeFileSize | 无 | Storefile 大小 |
+| compactionQueueLength | 无 | Compaction 队列长度
|
+| percentFilesLocal | 无 | Region 的 HFile 位于本地 HDFS Data Node的比例
|
+| percentFilesLocalSecondaryRegions | 无 | Region 副本的 HFile 位于本地 HDFS
Data Node的比例 |
+| hlogFileCount | 无 | WAL 文件数量
|
+| hlogFileSize | 无 | WAL 文件大小 |
+
+#### 指标集合:IPC
+
+
+| 指标名称 | 指标单位 | 指标帮助描述 |
+| --------------------- | ------ | ------------------- |
+| numActiveHandler | 无 | 当前的 RIT 数量 |
+| NotServingRegionException | 无 | 超过阈值的 RIT 数量 |
+| RegionMovedException | ms | 最老的RIT的持续时间 |
+| RegionTooBusyException | ms | 最老的RIT的持续时间 |
+
+#### 指标集合:JVM
+
+
+| 指标名称 | 指标单位 | 指标帮助描述 |
+| ----------------------- | ----- | ------------------------ |
+| MemNonHeapUsedM | 无 | 当前活跃RegionServer列表 |
+| MemNonHeapCommittedM | 无 | 当前离线RegionServer列表 |
+| MemHeapUsedM | 无 | Zookeeper列表 |
+| MemHeapCommittedM | 无 | Master节点 |
+| MemHeapMaxM | 无 | 集群负载均衡次数 |
+| MemMaxM | 无 | RPC句柄数 |
+| GcCount | MB | 集群接收数据量 |
+
diff --git a/home/sidebars.json b/home/sidebars.json
index ee96933da..bc2aff9a6 100644
--- a/home/sidebars.json
+++ b/home/sidebars.json
@@ -210,6 +210,8 @@
"help/doris_be",
"help/doris_fe",
"help/hadoop",
+ "help/hbase_master",
+ "help/hbase_regionserver",
"help/iotdb",
"help/hive",
"help/airflow",
diff --git a/manager/src/main/resources/define/app-hbase_regionserver.yml
b/manager/src/main/resources/define/app-hbase_regionserver.yml
new file mode 100644
index 000000000..9a1eb770c
--- /dev/null
+++ b/manager/src/main/resources/define/app-hbase_regionserver.yml
@@ -0,0 +1,578 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements. See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License. You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+# The monitoring type category:service-application service monitoring
db-database monitoring custom-custom monitoring os-operating system monitoring
+category: bigdata
+# The monitoring type eg: linux windows tomcat mysql aws...
+app: hbase_regionserver
+# The monitoring i18n name
+name:
+ zh-CN: Apache Hbase RegionServer
+ en-US: Apache Hbase RegionServer
+# The description and help of this monitoring type
+help:
+ zh-CN: Hertzbeat 对 Hbase 数据库 RegionServer 节点监控指标进行监控。<br>您可以点击 “<i>新建 Apache
Hbase RegionServer</i>” 并进行配置,或者选择“<i>更多操作</i>”,导入已有配置。
+ en-US: Hertzbeat monitors the RegionServer node monitoring indicators of the
Hbase database. <br>You can click "<i>New Apache Hbase RegionServer</i>" to
configure, or select "<i>More Actions</i>" to import an existing configuration.
+ zh-TW: Hertzbeat 對 Hbase 數據庫 RegionServer 节點監控指標進行監控。<br>您可以點擊 “<i>新建 Apache
Hbase RegionServer</i>” 並進行配置,或者選擇“<i>更多操作</i>”,導入已有配置。
+
+helpLink:
+ zh-CN: https://hertzbeat.apache.org/zh-cn/docs/help/hbase_regionserver/
+ en-US: https://hertzbeat.apache.org/docs/help/hbase_regionserver/
+# Input params define for monitoring(render web ui by the definition)
+params:
+ # field-param field key
+ - field: host
+ # name-param field display i18n name
+ name:
+ zh-CN: 目标Host
+ en-US: Target Host
+ # type-param field type(most mapping the html input type)
+ type: host
+ # required-true or false
+ required: true
+ # field-param field key
+ - field: port
+ # name-param field display i18n name
+ name:
+ zh-CN: 端口
+ en-US: Port
+ # type-param field type(most mapping the html input type)
+ type: number
+ # when type is number, range is required
+ range: '[0,65535]'
+ # required-true or false
+ required: true
+ # default value
+ defaultValue: 16030
+ # field-param field key
+ - field: timeout
+ # name-param field display i18n name
+ name:
+ zh-CN: 查询超时时间
+ en-US: Query Timeout
+ # type-param field type(most mapping the html input type)
+ type: number
+ # required-true or false
+ required: false
+ # hide param-true or false
+ hide: true
+ # default value
+ defaultValue: 6000
+# collect metrics config list
+metrics:
+ # metrics - Server
+ - name: Server
+ # metrics scheduling priority(0->127)->(high->low), metrics with the
same priority will be scheduled in parallel
+ # priority 0's metrics is availability metrics, it will be scheduled
first, only availability metrics collect success will the scheduling continue
+ priority: 0
+ # collect metrics content
+ fields:
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: regionCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Region数量
+ en-US: regionCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: readRequestCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 重启集群后的读请求数量
+ en-US: readRequestCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: writeRequestCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 重启集群后的写请求数量
+ en-US: writeRequestCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: averageRegionSize
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 平均Region大小
+ en-US: averageRegionSize
+ unit: 'MB'
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: totalRequestCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 全部请求数量
+ en-US: totalRequestCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: ScanTime_num_ops
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Scan 请求总量
+ en-US: ScanTime_num_ops
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: Append_num_ops
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Append 请求量
+ en-US: Append_num_ops
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: Increment_num_ops
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Increment请求量
+ en-US: Increment_num_ops
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: Get_num_ops
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Get 请求量
+ en-US: Get_num_ops
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: Delete_num_ops
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Delete 请求量
+ en-US: Delete_num_ops
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: Put_num_ops
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Put 请求量
+ en-US: Put_num_ops
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: ScanTime_mean
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 平均 Scan 请求时间
+ en-US: ScanTime_mean
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: ScanTime_min
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 最小 Scan 请求时间
+ en-US: ScanTime_min
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: ScanTime_max
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 最大 Scan 请求时间
+ en-US: ScanTime_max
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: ScanSize_mean
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 平均 Scan 请求大小
+ en-US: ScanSize_mean
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: ScanSize_min
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 最小 Scan 请求大小
+ en-US: ScanSize_min
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: ScanSize_max
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 最大 Scan 请求大小
+ en-US: ScanSize_max
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: slowPutCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 慢操作次数/Put
+ en-US: slowPutCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: slowGetCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 慢操作次数/Get
+ en-US: slowGetCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: slowAppendCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 慢操作次数/Append
+ en-US: slowAppendCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: slowIncrementCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 慢操作次数/Increment
+ en-US: slowIncrementCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: slowDeleteCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 慢操作次数/Delete
+ en-US: slowDeleteCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: blockCacheSize
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: 缓存块内存占用大小
+ en-US: blockCacheSize
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: blockCacheCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 缓存块数量_Block Cache 中的 Block 数量
+ en-US: blockCacheCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: blockCacheExpressHitPercent
+ type: 0
+ label: true
+ i18n:
+ zh-CN: 读缓存命中率
+ en-US: blockCacheExpressHitPercent
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: memStoreSize
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Memstore 大小
+ en-US: memStoreSize
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: FlushTime_num_ops
+ type: 0
+ label: true
+ i18n:
+ zh-CN: RS写磁盘次数/MemStore Flush 写磁盘次数
+ en-US: FlushTime_num_ops
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: flushQueueLength
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Region Flush 队列长度
+ en-US: flushQueueLength
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: flushedCellsSize
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: flush到磁盘大小
+ en-US: flushedCellsSize
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: storeCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Store 个数
+ en-US: storeCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: storeFileCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Storefile 个数
+ en-US: storeFileCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: storeFileSize
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: Storefile 大小
+ en-US: storeFileSize
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: compactionQueueLength
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Compaction 队列长度
+ en-US: compactionQueueLength
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: percentFilesLocal
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Region 的 HFile 位于本地 HDFS data node的比例
+ en-US: percentFilesLocal
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: percentFilesLocalSecondaryRegions
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Region 副本的 HFile 位于本地 HDFS data node的比例
+ en-US: percentFilesLocalSecondaryRegions
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: hlogFileCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: WAL 文件数量
+ en-US: hlogFileCount
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: hlogFileSize
+ type: 0
+ label: true
+ i18n:
+ zh-CN: WAL 文件大小
+ en-US: hlogFileSize
+ # (optional)metrics field alias name, it is used as an alias field to
map and convert the collected data and metrics field
+ aliasFields:
+ - $.regionCount
+ - $.readRequestCount
+ - $.writeRequestCount
+ - $.averageRegionSize
+ - $.totalRequestCount
+ - $.ScanTime_num_ops
+ - $.Append_num_ops
+ - $.Increment_num_ops
+ - $.Get_num_ops
+ - $.Delete_num_ops
+ - $.Put_num_ops
+ - $.ScanTime_mean
+ - $.ScanTime_min
+ - $.ScanTime_max
+ - $.ScanSize_mean
+ - $.ScanSize_min
+ - $.ScanSize_max
+ - $.slowPutCount
+ - $.slowGetCount
+ - $.slowAppendCount
+ - $.slowIncrementCount
+ - $.slowDeleteCount
+ - $.blockCacheSize
+ - $.blockCacheCount
+ - $.blockCacheExpressHitPercent
+ - $.memStoreSize
+ - $.FlushTime_num_ops
+ - $.flushQueueLength
+ - $.flushedCellsSize
+ - $.storeCount
+ - $.storeFileCount
+ - $.storeFileSize
+ - $.compactionQueueLength
+ - $.percentFilesLocal
+ - $.percentFilesLocalSecondaryRegions
+ - $.hlogFileCount
+ - $.hlogFileSize
+ calculates:
+ - regionCount=$.regionCount
+ - readRequestCount=$.readRequestCount
+ - writeRequestCount=$.writeRequestCount
+ - averageRegionSize=$.averageRegionSize
+ - totalRequestCount=$.totalRequestCount
+ - ScanTime_num_ops=$.ScanTime_num_ops
+ - Append_num_ops=$.Append_num_ops
+ - Increment_num_ops=$.Increment_num_ops
+ - Get_num_ops=$.Get_num_ops
+ - Delete_num_ops=$.Delete_num_ops
+ - Put_num_ops=$.Put_num_ops
+ - ScanTime_mean=$.ScanTime_mean
+ - ScanTime_min=$.ScanTime_min
+ - ScanTime_max=$.ScanTime_max
+ - ScanSize_mean=$.ScanSize_mean
+ - ScanSize_min=$.ScanSize_min
+ - ScanSize_max=$.ScanSize_max
+ - slowPutCount=$.slowPutCount
+ - slowGetCount=$.slowGetCount
+ - slowAppendCount=$.slowAppendCount
+ - slowIncrementCount=$.slowIncrementCount
+ - slowDeleteCount=$.slowDeleteCount
+ - blockCacheSize=$.blockCacheSize
+ - blockCacheCount=$.blockCacheCount
+ - blockCacheExpressHitPercent=$.blockCacheExpressHitPercent
+ - memStoreSize=$.memStoreSize
+ - FlushTime_num_ops=$.FlushTime_num_ops
+ - flushQueueLength=$.flushQueueLength
+ - flushedCellsSize=$.flushedCellsSize
+ - storeCount=$.storeCount
+ - storeFileCount=$.storeFileCount
+ - storeFileSize=$.storeFileSize
+ - compactionQueueLength=$.compactionQueueLength
+ - percentFilesLocal=$.percentFilesLocal
+ - percentFilesLocalSecondaryRegions=$.percentFilesLocalSecondaryRegions
+ - hlogFileCount=$.hlogFileCount
+ - hlogFileSize=$.hlogFileSize
+ units:
+ - averageRegionSize=B->MB
+ - blockCacheSize=B->MB
+ - storeFileSize=B->MB
+ - flushedCellsSize=B->MB
+ protocol: http
+ http:
+ host: ^_^host^_^
+ port: ^_^port^_^
+ url: /jmx
+ method: GET
+ ssl: ^_^ssl^_^
+ parseType: jsonPath
+ parseScript: '$.beans[?(@.name ==
"Hadoop:service=HBase,name=RegionServer,sub=Server")]'
+ # metrics - IPC
+ - name: IPC
+ # metrics scheduling priority(0->127)->(high->low), metrics with the
same priority will be scheduled in parallel
+ # priority 0's metrics is availability metrics, it will be scheduled
first, only availability metrics collect success will the scheduling continue
+ priority: 0
+ # collect metrics content
+ fields:
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: numActiveHandler
+ type: 0
+ label: true
+ i18n:
+ zh-CN: RPC句柄数
+ en-US: numActiveHandler
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: NotServingRegionException
+ type: 0
+ label: true
+ i18n:
+ zh-CN: NotServingRegionException 异常数量
+ en-US: NotServingRegionException
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: RegionMovedException
+ type: 0
+ label: true
+ i18n:
+ zh-CN: RegionMovedException异常数量
+ en-US: RegionMovedException
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: RegionTooBusyException
+ type: 0
+ label: true
+ i18n:
+ zh-CN: RegionTooBusyException异常数量
+ en-US: RegionTooBusyException
+ # (optional)metrics field alias name, it is used as an alias field to
map and convert the collected data and metrics field
+ aliasFields:
+ - $.numActiveHandler
+ - $.['exceptions.NotServingRegionException']
+ - $.['exceptions.RegionMovedException']
+ - $.['exceptions.RegionTooBusyException']
+ calculates:
+ - numActiveHandler=$.numActiveHandler
+ -
NotServingRegionException=#`$.['exceptions.NotServingRegionException']`
+ - RegionMovedException=#`$.['exceptions.RegionMovedException']`
+ - RegionTooBusyException=#`$.['exceptions.RegionTooBusyException']`
+ protocol: http
+ http:
+ host: ^_^host^_^
+ port: ^_^port^_^
+ url: /jmx
+ method: GET
+ ssl: ^_^ssl^_^
+ parseType: jsonPath
+ parseScript: '$.beans[?(@.name ==
"Hadoop:service=HBase,name=RegionServer,sub=IPC")]'
+ # metrics - JVM
+ - name: JVM
+ # metrics scheduling priority(0->127)->(high->low), metrics with the
same priority will be scheduled in parallel
+ # priority 0's metrics is availability metrics, it will be scheduled
first, only availability metrics collect success will the scheduling continue
+ priority: 0
+ # collect metrics content
+ fields:
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: MemNonHeapUsedM
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: 进程使用的非堆内存大小
+ en-US: MemNonHeapUsedM
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: MemNonHeapCommittedM
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: 进程 commit 的非堆内存大小
+ en-US: MemNonHeapCommittedM
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: MemHeapUsedM
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: 进程使用的堆内存大小
+ en-US: MemHeapUsedM
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: MemHeapCommittedM
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: 进程 commit 的堆内存大小
+ en-US: MemHeapCommittedM
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: MemHeapMaxM
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: 进程最大的堆内存大小
+ en-US: MemHeapMaxM
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: MemMaxM
+ type: 0
+ label: true
+ unit: 'MB'
+ i18n:
+ zh-CN: 进程最大内存大小
+ en-US: MemMaxM
+ # field-metric name, type-metric type(0-number,1-string), unit-metric
unit('%','ms','MB'), label-whether it is a metrics label field
+ - field: GcCount
+ type: 0
+ label: true
+ i18n:
+ zh-CN: Young GC次数
+ en-US: GcCount
+ # (optional)metrics field alias name, it is used as an alias field to
map and convert the collected data and metrics field
+ aliasFields:
+ - $.MemNonHeapUsedM
+ - $.MemNonHeapCommittedM
+ - $.MemHeapUsedM
+ - $.MemHeapCommittedM
+ - $.MemHeapMaxM
+ - $.MemMaxM
+ - $.GcCount
+ calculates:
+ - MemNonHeapUsedM=$.MemNonHeapUsedM
+ - MemNonHeapCommittedM=$.MemNonHeapCommittedM
+ - MemHeapUsedM=$.MemHeapUsedM
+ - MemHeapCommittedM=$.MemHeapCommittedM
+ - MemHeapMaxM=$.MemHeapMaxM
+ - MemMaxM=$.MemMaxM
+ - GcCount=$.GcCount
+ protocol: http
+ http:
+ host: ^_^host^_^
+ port: ^_^port^_^
+ url: /jmx
+ method: GET
+ ssl: ^_^ssl^_^
+ parseType: jsonPath
+ parseScript: '$.beans[?(@.name ==
"Hadoop:service=HBase,name=JvmMetrics")]'
\ No newline at end of file
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]