codelipenghui opened a new issue, #18963:
URL: https://github.com/apache/pulsar/issues/18963

   ### Search before asking
   
   - [X] I searched in the [issues](https://github.com/apache/pulsar/issues) 
and found nothing similar.
   
   
   ### Motivation
   
   Currently, we have topic load-related metrics like the followings:
   
   ```
   topic_load_times{cluster="standalone",quantile="0.5"} 140.0
   topic_load_times{cluster="standalone",quantile="0.75"} 183.0
   topic_load_times{cluster="standalone",quantile="0.95"} 249.0
   topic_load_times{cluster="standalone",quantile="0.99"} 249.0
   topic_load_times{cluster="standalone",quantile="0.999"} 249.0
   topic_load_times{cluster="standalone",quantile="0.9999"} 249.0
   topic_load_times_count{cluster="standalone"} 6.0
   topic_load_times_sum{cluster="standalone"} 955.0
   topic_load_times_created{cluster="standalone"} 1.671240308864E9
   ```
   
   But we are not able to detect if there are topics that failed to load due to
   zookeeper/bookkeeper problems.
   
   It's better to add new metrics for the topic load failed operation so that 
users
   can add alerts based on the metrics.
   
   ### Solution
   
   Add `topic_load_failed_count` metrics
   
   ### Alternatives
   
   _No response_
   
   ### Anything else?
   
   The metrics changes requires a proposal
   
   ### Are you willing to submit a PR?
   
   - [ ] I'm willing to submit a PR!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to