Re: Flink Metrics Naming

Chesnay Schepler Tue, 01 Jun 2021 08:43:56 -0700

Some more background on MetricGroups:
Internally there (mostly) 3 types of metric groups:

On the one hand we have the ComponentMetricGroups (likeTaskManagerMetricGroup) that describe a high-level Flink entity, whichjust add a constant expression to the logical scope(like taskmanager,task etc.). These exist to support scope formats (although thisshould've been implemented differently, but that's a another story).

On the other hand we have groups created via addGroup(String), which areadded to the logical scope as is; this is sometimes good(e.g.,addGroup("KafkaConsumer"), and sometimes isn't (e.g.,addGroup(<some-topic-id>).Finally, there is a addGroup(String, String) variant, which behaves likea key-value pair (and similarly to the ComponentMetricGroup). The keypart is added to the logical scope, and a label is usually added as well.

Due to historical reasons some parts in Flink use addGroup(String)despite the key-value pair variant being more appropriate; the latterwas only added later, as was the logical scope as a whole for that matter.

With that said, the logical scope and labels suffer a bit due to beingretrofitted on an existing design and some early mistakes in the metricstructuring.Ideally (imo), things would work like this (*bold *parts signify changesto the current behavior):- addGroup(String) is *sparsely used* and only for high-levelhierarchies (job, operator, source, kafka). It is added as is to thelogical scope, creates no label, and is *excluded from the metricidentifier*.- addGroup(String, String) has *no effect on the logical scope*, createsa label, and is added as <key>.<value> to the metric identifier.

The core issue with these kind of changes however is backwardscompatibility. We would have to do a sweep over the code-base to migrateinappropriate usages of addGroup(String) to the key-pair variant,probably remove some unnecessary groups (e.g., "Status" that is used forCPU metrics and whatnot) and finally make changes to the metric systeminternals, all of which need a codepath that retain the current behavior.

Simply put, for immediate needs I would probably encourage you do createa modified PrometheusReporter which determines the logical scope as yousee fit; it could just ignore the logical scope entirely (although I'mnot sure how well prometheus handles 1 metric having multiple instanceswith different label sets (e.g., numRecordsIn for operators/tasks), orexclude user-defined groups with something hacky like only using thefirst 4 parts of the logical scope.


On 6/1/2021 4:56 PM, Mason Chen wrote:

Upon further inspection, it seems like the user scope is not universal(i.e. comes through the connectors and not UDFs (like rich mapfunction)), but the question still stands if the process makes sense.
On Jun 1, 2021, at 10:38 AM, Mason Chen <[email protected]<mailto:[email protected]>> wrote:
Makes sense. We are primarily concerned with removing the metriclabels from the names as the user metrics get too long. i.e. thegroups from `addGroup` are concatenated in the metric name.
Do you think there would be any issues with removing the groupinformation in the metric name and putting them into a label instead?In seems like most metrics internally, don’t use `addGroup` to creategroup information but rather by creating another subclass of metricgroup.
Perhaps, I should ONLY apply this custom logic to metrics with the“user” scope? Other scoped metrics (e.g. operator, task operator,etc.) shouldn’t have these group names in the metric names in myexperience...
An example just for clarity,flink_<system_scope>_group1_group2_metricName{group1=…, group2=…,flink tags}
=>
flink_<system_scope>_metricName{group_info=group1_group2, group1=…,group2=…, flink tags}
On Jun 1, 2021, at 9:57 AM, Chesnay Schepler <[email protected]<mailto:[email protected]>> wrote:
The uniqueness of metrics and the naming of the Prometheus reporterare somewhat related but also somewhat orthogonal.
Prometheus works similar to JMX in that the metric name (e.g.,taskmanager.job.task.operator.numRecordsIn) is more or less a_class_ of metrics, with tags/labels allowing you to select aspecific instance of that metric.
Restricting metric names to 1 level of the hierarchy would present afew issues:a) Effectively, all metric names that Flink uses effectively becomereserved keywords that users must not use, which will lead toheadaches when adding more metrics or forwarding metrics fromlibraries (e.g., kafka), because we could always break existinguser-defined metrics.b) You'd need a cluster-wide lookup that is aware of all hierarchiesto ensure consistency across all processes.
In the end, there are significantly easier ways to solve the issueof the metric name being too long, i.e., give the user more controlover the logical scope (taskmanager.job.task.operator), be itshortening the names (t.j.t.o), limiting the depth (e.g,operator.numRecordsIn), removing it outright (but I'd prefer somecontext to be present for clarity) or supporting something similarto scope formats.I'm reasonably certain there are some tickets already in thisdirection, we just don't get around to doing them because for themost part the metric system works good enough and there are biggerfish to fry.
On 6/1/2021 3:39 PM, Till Rohrmann wrote:
Hi Mason,
The idea is that a metric is not uniquely identified by its namealone but instead by its path. The groups in which it is definedspecify this path (similar to directories). That's why it is validto specify two metrics with the same name if they reside indifferent groups.
I think Prometheus does not support such a tree structure andthat's why the path is exposed via labels if I am not mistaken. Solong story short, what you are seeing is a combination of how Flinkorganizes metrics and what can be reported to Prometheus.
I am also pulling in Chesnay who is more familiar with this part ofthe code.
Cheers,
Till
On Fri, May 28, 2021 at 7:33 PM Mason Chen <[email protected]<mailto:[email protected]>> wrote:
    Can anyone give insight as to why Flink allows 2 metrics with
    the same “name”?

    For example,

    getRuntimeContext.addGroup(“group”,
    “group1”).counter(“myMetricName”);

    And

    getRuntimeContext.addGroup(“other_group”,
    “other_group1”).counter(“myMetricName”);

    Are totally valid.


    It seems that it has lead to some not-so-great
    implementations—the prometheus reporter and attaching the
    labels to the metric name, making the name quite verbose.

Re: Flink Metrics Naming

Reply via email to