Re: Type of builtin PTransform/PCollection metrics

Maximilian Michels Wed, 13 Nov 2019 10:56:44 -0800

Are you referring specifically to?
* beam:metric:element_count:v1
* beam:metric:pardo_execution_time:start_bundle_msecs:v1
* beam:metric:pardo_execution_time:process_bundle_msecs:v1
* beam:metric:pardo_execution_time:finish_bundle_msecs:v1
* beam:metric:ptransform_execution_time:total_msecs:v1


Yes.

Would the gauge be grouped per element or per bundle?


Per bundle. These are reported when the bundle finishes.

If grouped at the bundle level the metrics are arbitrary to the user since the 
bundle size is chosen by the runner.

Not necessarily because the bundle size is typically fixed (at least inthe Flink Runner). In any case, it provides information about how muchactivity occurred in a bundle which is useful to know.

There is also a very significant overhead for tracking low level metrics

I can't imagine tracking a per-bundle element count or execution time isthat expensive. Maybe I'm wrong.


-Max

On 13.11.19 18:58, Luke Cwik wrote:

Are you referring specifically to?
* beam:metric:element_count:v1
* beam:metric:pardo_execution_time:start_bundle_msecs:v1
* beam:metric:pardo_execution_time:process_bundle_msecs:v1
* beam:metric:pardo_execution_time:finish_bundle_msecs:v1
* beam:metric:ptransform_execution_time:total_msecs:v1

Would the gauge be grouped per element or per bundle?
If grouped at the bundle level the metrics are arbitrary to the usersince the bundle size is chosen by the runner.
If grouped at the element level then only a few of the metrics make sense:
* element_count becomes number of outputs per input element
* process_bundle_msecs becomes amount of time to process a single inputelement (does this still apply to elements that can be split?)
There is also a very significant overhead for tracking low level metricsin great detail which is why timing is done through a samplingtechnique. I'm sure if we could do it cheaply then it would make senseto get those metrics. This is also a place where we want each SDK toimplement these metrics so complexity may slow down SDK authors fromdeveloping them.
On Wed, Nov 13, 2019 at 5:13 AM Maximilian Michels <[email protected]<mailto:[email protected]>> wrote:
    Hi,

    We have a series of builtin PTransform/PCollection metrics:
    
https://github.com/apache/beam/blob/808cb35018cd228a59b152234b655948da2455fa/model/pipeline/src/main/proto/metrics.proto#L74

    Why are those of counters ("beam:metrics:sum_int_64")? I think the
    better default type for most users would be gauge
    ("beam:metrics:latest_int_64").

    I understand that counters are useful because they retain the sum of
    all
    reported values, but for getting an idea about the deviation of a
    metric, gauges could be more useful.

    Perhaps we could make this configurable?

    Thanks,
    Max

Re: Type of builtin PTransform/PCollection metrics

Reply via email to