If you're instrumenting your application with a prometheus client library, then you won't be generating a metric for every requests. Almost certainly you want to be using histograms: that is, you divide the response times into buckets at values that make sense for your use case (say 1ms, 5ms, 10ms, 50ms, 100ms) and increment a counter for the appropriate bucket.
It looks like the nginx ingress controller already has this feature, albeit with fixed bucket sizes: https://github.com/kubernetes/ingress-nginx/issues/3898 -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/ae9c7475-a67e-4e68-8168-283d54be2eba%40googlegroups.com.

