I am trying to identify an alert for an increasing Kafka Lag. The metrics 
are in prometheus and I use Grafana 9.4 for visualisation.

In Grafana I have a dasboard where I can see that the lag is increasing or 
decreasing. I have choosen to display the delta in the legend and it shows 
a number that does not really make sense, but it shows something.

This is the base promql:
avg by(consumergroup, topic, cluster) 
(kafka_consumergroup_lag{namespace="ns-kafka-int", 
consumergroup=~".*$container", cluster="$cluster"}) 
  
When I add delta to this query I get no data:
avg by(consumergroup, topic, cluster) 
(delta(kafka_consumergroup_lag{namespace="ns-kafka-int", 
consumergroup=~".*$container", cluster="$cluster"}[$__interval]))

So adding delta gives me no result even though when I exclude delta I can 
see that the metric has been changing.

The metric is a gaugue. 

Question:
Am I doing something wrong? 
Can I trust the delta from the Grafana dashboard(someone might know)? 
Can I expect a positive or negative value to identify and increase?

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to prometheus-users+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/0f37aa72-dd90-4329-8e19-5587460b80c4n%40googlegroups.com.

Reply via email to