Re: [prometheus-users] Question regarding Open Metrics format and how to handle errors

Stuart Clark Tue, 09 Jun 2020 01:04:17 -0700

On 09/06/2020 08:45, vivek kodira wrote:

Hi Folks,
This is my first question here and it is not specifically aboutprometheus but about the way metrics are logged to a file. I hope youcan help.
We've implemented a service which logs metrics in the open metricsformat and as recommended in this page<https://prometheus.io/docs/instrumenting/exposition_formats/#text-based-format>.So typical metric entries will look like this:
|
app_disk_used_bytes 5.20941207552e+13 1588085025536
app_disk_free_bytes 7.1712562479104e+13 1588085025536
app_io_counters_read_bytes 2.4027136e+07 1588085025536
|
My doubt is - if/when errors occur when gathering a metric, how do welog them in this file? Can the "value" be replaced with a textindicating an "error". Or is the recommendation that errors not belogged and be recorded elsewhere?
I asked this question on the git repo<https://github.com/prometheus/docs/issues/1644> and was advised totry asking here instead.

Have an additional metric that is a counter of errors, so if an erroroccur that is incremented. You can then display or alert on the increase.

Another common option is an "up" metric (e.g. mysq_up) which has a valueof 0 if the metrics can't be fetched & 1 if all is OK.

For the broken metrics themselves you could leave them out (no metric isreturned at all during that scrape) or possibly return NaN.


--
Stuart Clark

--
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/64d73982-1264-0578-ec77-b754df8a7289%40Jahingo.com.

Re: [prometheus-users] Question regarding Open Metrics format and how to handle errors

Reply via email to