GitHub user socutes edited a discussion: proxy module monitoring indicators are improved
Current proxy module monitoring indicators are not perfect, need to improve the monitoring indicators for connection, time, thread several dimensions. It is recommended to add the following indicators: 1. Number of messages produced/consumed 2. Production/consumption message size / 3. Time spent on each interface request 4. Statistics on error codes of each interface 5. The number of calls to each interface 6. TCP(GRPC/Remoting) connection count statistics 7. Thread pool monitoring (total capacity, used capacity, free capacity of each thread pool) monitoring Added runtime metrics: 1. Process start time: Records the process start time 2. Process status: indicates that the current process is started or stopped 3. Current Cluster mode (Cluster/Local) : You can remotely view the cluster running mode 4. Configuration currently used by the process: The configuration items that are currently running can be obtained remotely Indicators of node dimension: Type | Name | Unit | Description | Label -- | -- | -- | -- | -- counter | rocketmq_messages_in_total | count | The number of messages that are produced. | cluster,node_type,node_id counter | rocketmq_messages_out_total | count | The number of messages that are consumed. | cluster,node_type,node_id counter | rocketmq_throughput_in_total | byte | The write throughput that are produced. | cluster,node_type,node_id counter | rocketmq_throughput_out_total | byte | The read throughput that are produced. | cluster,node_type,node_id histogram | rocketmq_message_size | byte | The distribution of message sizes. This metric is counted only when messages are sent. The following shows the distribution ranges:le_1_kb: ≤ 1 KBle_4_kb: ≤ 4 KBle_512_kb: ≤ 512 KBle_1_mb: ≤ 1 MBle_2_mb: ≤ 2 MBle_4_mb: ≤ 4 MBle_overflow: > 4 MB | cluster,node_type,node_id histogram | rocketmq_rpc_latency | millisecond | The rpc call latency | cluster,node_typ,node_id, request_code counter |rocketmq_response_code_total| count |The interface returned error code statistics | cluster,node_typ,node_id, response_code counter |rocketmq_request_total| count | Statistics on error codes of each interface| cluster,node_typ,node_id, request_code counter |rocketmq_connect_total| count | Number of node connections| cluster,node_typ,node_id,thread_type counter |rocketmq_thread_total| count | Total thread pool capacity | cluster,node_typ,node_id,thread_type counter |rocketmq_thread_free_total| count | Free capacity of the thread pool | cluster,node_typ,node_id,thread_type gauge | roceketmq_runtime_status | value | Process running status | cluster,node_typ,node_id gauge |rocketmq_runtime_starttime | value | Process start time | cluster,node_typ,node_id gauge | rocketmq_runtime_mode | value | proxy running mode | cluster,node_typ,node_id gauge |rocketmq_runtime_config | value | The configuration used by the process running | cluster,node_typ,node_id GitHub link: https://github.com/apache/rocketmq/discussions/6405 ---- This is an automatically sent email for dev@rocketmq.apache.org. To unsubscribe, please send an email to: dev-unsubscr...@rocketmq.apache.org