bhattmanish98 opened a new pull request, #8137: URL: https://github.com/apache/hadoop/pull/8137
This PR introduces a centralized Aggregated Metrics Manager and defines the conditions under which aggregated metrics are emitted from individual file systems. Key Changes 1. **Criteria for Emitting Aggregated Metrics** Aggregated metrics are emitted based on the following conditions: - _Time-based interval_ - Each file system periodically emits its collected metrics at a fixed interval. After emission, metric collection is reset. - _Threshold-based emission_ - A scheduler runs at regular intervals to check whether the total number of operations has exceeded a configured threshold. This prevents the aggregated metrics string from growing too large to be safely sent as an HTTP request header. If the threshold is reached, the collected metrics are emitted immediately, and metric collection is reset. - _Idle-period emission_ - If a file system remains idle for a configured duration, any accumulated metrics are emitted, and metric collection is reset. - _File system close_ - When a file system is closed, all remaining collected metrics are emitted to ensure no data is lost. 2. **Centralized Metrics Management** All file systems now push their aggregated metrics to a shared Aggregated Metrics Manager. This manager evaluates the configured emission criteria and determines whether metrics should be emitted immediately or deferred until a later time. This will also rate limit the number of metrics calls per second. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
