[ 
https://issues.apache.org/jira/browse/FLINK-38961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu updated FLINK-38961:
------------------------
    Description: 
h1. Summary

Add a new "Process Usage" panel on the TaskManager Metrics page of the Flink 
Web UI to display process-level metrics, including CPU usage, memory (RSS), and 
I/O statistics.
h1. Motivation

Currently, the TaskManager Metrics page in Flink Web UI only displays JVM and 
Flink-managed memory metrics. However, users often need to monitor 
process-level resource consumption to better understand the actual resource 
usage of TaskManagers. 

When `metrics.system-resource` is enabled, Flink collects process-level metrics 
such as:
 - `Process.CPU.Usage` - CPU usage percentage of the process
 - `Process.Memory.RSS` - Resident Set Size (physical memory used by the 
process)
 - `Process.IO.Read` / `Process.IO.Write` - I/O read and write bytes

These metrics are already available through the REST API but are not displayed 
in the Web UI, making it inconvenient for users to monitor them.
h1. Proposed Changes

1. Add a "Process Usage" card on the TaskManager Metrics page 
(`task-manager-metrics.component.html`) displaying:
   - CPU: Process CPU usage percentage
   - Memory: Process RSS (Resident Set Size)
   - I/O: Combined read and write I/O bytes

2. Extend the metrics query in `task-manager-metrics.component.ts` to include:
   - `Process.CPU.Usage`
   - `Process.Memory.RSS`
   - `Process.IO.Read`
   - `Process.IO.Write`
h1. Prerequisites

Users need to enable system resource metrics by setting 
`metrics.system-resource: true` in the Flink configuration (it is disabled by 
default). If this option is not enabled, the process metrics will show as 
empty/zero.
h1. UI Mockup

The new "Process Usage" panel will be placed at the top of the TaskManager 
Metrics page, showing three columns:
 - CPU (percentage with 6 decimal precision)
 - Memory (humanized bytes format)
 - I/O (sum of read and write bytes, humanized)

h1. Related Documentation
 - [System Resource 
Metrics]([https://nightlies.apache.org/flink/flink-docs-stable/docs/ops/metrics/#system-resources])

  was:
h1. Summary

Add a new "Process Usage" panel on the TaskManager Metrics page of the Flink 
Web UI to display process-level metrics, including CPU usage, memory (RSS), and 
I/O statistics.
h1. Motivation

Currently, the TaskManager Metrics page in Flink Web UI only displays JVM and 
Flink-managed memory metrics. However, users often need to monitor 
process-level resource consumption to better understand the actual resource 
usage of TaskManagers. 

When `metrics.system-resource` is enabled, Flink collects process-level metrics 
such as:
 - `Process.CPU.Usage` - CPU usage percentage of the process
 - `Process.Memory.RSS` - Resident Set Size (physical memory used by the 
process)
 - `Process.IO.Read` / `Process.IO.Write` - I/O read and write bytes

These metrics are already available through the REST API but are not displayed 
in the Web UI, making it inconvenient for users to monitor them.
h1. Proposed Changes

1. Add a "Process Usage" card{{*}}{*} on the TaskManager Metrics page 
(`task-manager-metrics.component.html`) displaying:
   - CPU: Process CPU usage percentage
   - Memory: Process RSS (Resident Set Size)
   - I/O: Combined read and write I/O bytes

2. Extend the metrics query in `task-manager-metrics.component.ts` to include:
   - `Process.CPU.Usage`
   - `Process.Memory.RSS`
   - `Process.IO.Read`
   - `Process.IO.Write`
h1. Prerequisites

Users need to enable system resource metrics by setting 
`metrics.system-resource: true` in the Flink configuration (it is disabled by 
default). If this option is not enabled, the process metrics will show as 
empty/zero.
h1. UI Mockup

The new "Process Usage" panel will be placed at the top of the TaskManager 
Metrics page, showing three columns:
 - CPU (percentage with 6 decimal precision)
 - Memory (humanized bytes format)
 - I/O (sum of read and write bytes, humanized)

h1. Related Documentation
 - [System Resource 
Metrics]([https://nightlies.apache.org/flink/flink-docs-stable/docs/ops/metrics/#system-resources])


> Display process metrics (CPU, Memory, I/O) on TaskManager Web UI
> ----------------------------------------------------------------
>
>                 Key: FLINK-38961
>                 URL: https://issues.apache.org/jira/browse/FLINK-38961
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Web Frontend
>            Reporter: Liu
>            Priority: Major
>
> h1. Summary
> Add a new "Process Usage" panel on the TaskManager Metrics page of the Flink 
> Web UI to display process-level metrics, including CPU usage, memory (RSS), 
> and I/O statistics.
> h1. Motivation
> Currently, the TaskManager Metrics page in Flink Web UI only displays JVM and 
> Flink-managed memory metrics. However, users often need to monitor 
> process-level resource consumption to better understand the actual resource 
> usage of TaskManagers. 
> When `metrics.system-resource` is enabled, Flink collects process-level 
> metrics such as:
>  - `Process.CPU.Usage` - CPU usage percentage of the process
>  - `Process.Memory.RSS` - Resident Set Size (physical memory used by the 
> process)
>  - `Process.IO.Read` / `Process.IO.Write` - I/O read and write bytes
> These metrics are already available through the REST API but are not 
> displayed in the Web UI, making it inconvenient for users to monitor them.
> h1. Proposed Changes
> 1. Add a "Process Usage" card on the TaskManager Metrics page 
> (`task-manager-metrics.component.html`) displaying:
>    - CPU: Process CPU usage percentage
>    - Memory: Process RSS (Resident Set Size)
>    - I/O: Combined read and write I/O bytes
> 2. Extend the metrics query in `task-manager-metrics.component.ts` to include:
>    - `Process.CPU.Usage`
>    - `Process.Memory.RSS`
>    - `Process.IO.Read`
>    - `Process.IO.Write`
> h1. Prerequisites
> Users need to enable system resource metrics by setting 
> `metrics.system-resource: true` in the Flink configuration (it is disabled by 
> default). If this option is not enabled, the process metrics will show as 
> empty/zero.
> h1. UI Mockup
> The new "Process Usage" panel will be placed at the top of the TaskManager 
> Metrics page, showing three columns:
>  - CPU (percentage with 6 decimal precision)
>  - Memory (humanized bytes format)
>  - I/O (sum of read and write bytes, humanized)
> h1. Related Documentation
>  - [System Resource 
> Metrics]([https://nightlies.apache.org/flink/flink-docs-stable/docs/ops/metrics/#system-resources])



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to