Joerg Hoh created SLING-11192:
---------------------------------

             Summary: Calculating metrics takes too long
                 Key: SLING-11192
                 URL: https://issues.apache.org/jira/browse/SLING-11192
             Project: Sling
          Issue Type: Improvement
          Components: Event
    Affects Versions: Event 4.2.24
            Reporter: Joerg Hoh


we use the prometheus exporter to export Sling Metrics / Dropwizard metrics, 
and we often see messages like this:

{noformat}
10.03.2022 08:50:15.333 [...] *WARN* [qtp568481508-1779] 
io.prometheus.client.dropwizard.DropwizardExports Gauge has been blacklisted 
for 300000 ms due timeout:  Generated from Dropwizard metric import 
(metric=sling_event.jobs.cancelled.count, 
type=org.apache.sling.event.impl.jobs.stats.GaugeSupport$2) 
{noformat}

This means that calculating the metric took too long. We should make sure that 
the calculation is done asnychronously and just pre-computed values are 
returned.

For at least these values the handling needs to be improved:
* sling_event.jobs.active.count
* sling_event.jobs.averageProcessingTime
* sling_event.jobs.averageWaitingTime
* sling_event.jobs.cancelled.count






--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to