Re: [prometheus-users] Not able to push the metrics from Prometheus on a Kubernetes cluster to another Prometheus server

Stuart Clark Thu, 25 Feb 2021 14:24:30 -0800

On 25/02/2021 20:29, Saurabh Vartak wrote:

Hi Stuart,
Thanks again for your help and continued guidance. So if I am able tosummarise your suggestions in a nutshell:1. If there is a requirement to have aggregated metrics in place,Prometheus Federation would be the way to go.2. If there is a requirement for long term retention (either for asingle Prometheus server or a bunch of Prometheus servers) an externalstorage solution like Cortex or Thanos can be used.
I hope I am correct with the above two points.

Also, I needed your help on the below 2 questions to wrap this thread:
1. When we use Prometheus Federation, the metrics sent from aPrometheus server to a Centralized Prometheus server do get stored inthe TDSB of the Centralized Prometheus Server. Is the understandingcorrect?

That is correct. The central server sees the federation with the otherserver in exactly the same way as any other scrape target.

So whatever storage duration and any remote write configuration wouldapply (in the same way as any other targets the central server scrapes).

2. When we use Prometheus Federation, all the metrics scraped by aPrometheus server can be sent to the Centralized Prometheus server.However as a best practice, it is always recommended to send only theaggregated metrics to the Centralized Prometheus server. Is theunderstanding correct?

Federation is different to "sending" metrics around. In particular whena server scrapes the federation endpoint it returns the latest value forall metrics that have been selected at that point in time. For exampleif say the scrape period of a target was 30s but the period for thefederation was 120s then the local server would hold 4 values for every2 minute period, but the central server will only contain 1.

While you could try to use federation to fetch all metrics (rememberingthat you wouldn't necessarily get all values scraped by the localserver) you may quickly find resource limitations. The text format usedwould not be as efficient as that used for remote write for example, soyou might find high network or CPU usage for both servers. Equally,depending on the quantity of metrics and the scrape interval chosen forthe central server you could find the volume was so great that it failedto ingest within the timeout period (at maximum the same as the scrapeperiod).

This would be in addition to the multiplication effect of trying tostore all metrics in a central server (a server could handle 1 milliontime series, but trying to federate all metrics from 100 such serverscentrally would need the central server to handle 100 million timeseries, which would likely require a lot more resources than wouldreasonably be available).


--
Stuart Clark

--
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/bf446ede-e735-da31-5731-07f994a4dd3e%40Jahingo.com.

Re: [prometheus-users] Not able to push the metrics from Prometheus on a Kubernetes cluster to another Prometheus server

Reply via email to