On 27/02/2021 15:42, Saurabh Vartak wrote:
Hi Stuart,

Thanks to all the knowledge and guidance you have imparted on me, I have decided to go with the below approach:

1. For the scenarios where aggregation of metrics is desired, I will implement Prometheus Federation 2. For viewing the metrics of multiple Kubernetes clusters individually, I will implement a Central Grafana dashboard with individual AKS clusters added as datasources 3. For long term retention of the metrics or back up of the metrics, I will use the option of *remote_write* to write all the metrics from individual Kubernetes clusters to an InfluxDB instance. In case of any data loss, I can have the new Prometheus server instance created and have its *remote_read *pointed to this instance of InfluxDB - so that the same Grafana dashboards with the same PromQL queries be used. If a *remote_write* based back up is not desired due to any reasons, then the simple option of taking disk snapshots of the Prometheus server can be done ... although the snapshots have to be taken at a higher frequency if the loss of the metrics data is to be minimized.

Does this sound like a plan?

That sounds perfectly reasonable.

I hope you get it all working and it does what you are hoping for :-)

--
Stuart Clark

--
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/525cb45a-8791-f8ef-e5d4-cad53a78cb50%40Jahingo.com.

Reply via email to