Hello, folks.

The story.
In my org we have several k8s clusters and quite unreliable security 
department, who has control over firewall and have a habbit of corrupting 
the rules on said firewall. 
The confusion is immense. The issue is that at any point in time *one or 
several nodes can lose access to one or several external resources*.

So, we had a brilliant idea to mitigate the confusion by deploying a *blackbox 
daemonset *and *configure Prometheus to query external resources*. But! We 
have many-to-many relationship here. We want to query multiple resources 
from all of the worker nodes in cluster.

This means, that we need auto discovery (to dynamically get instances of 
daemon set) and multiple targets. 
In docs we found an example of multi target requests though static configs. 
Unfortunately, as the name suggest, it is a static config and we can't get 
the *metadata of node* that lost connection or *reliably* determine the 
fact that connection was lost at all (no way to properly configure alert 
threshold, so it wouldn't misfire).

On the other hand - there is Kubernetes service discovery. But the only way 
we found is to have a separate job for each of the remote resources.

Is there, maybe, a better way to approach the problem?

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/32246179-4d6f-4fab-92fb-192e2abd154dn%40googlegroups.com.

Reply via email to