Hi Richard,

The rational was to preserve Beam's DistributionResult through the use of Flink Gauges. Whoever implemented this, wasn't fully aware that Flink Histograms would be a better fit.

Feel free to open a PR. You can mention us here for a review.

Thanks,
Max

On 20.11.20 03:06, Alex Amato wrote:
Are you referring to a "Flink Gauge" or a "Beam Gauge"? Are you suggesting to package it as a "Flink Histogram?" (i.e. A Flink runner specific concept of Histograms) If so, seems fine and I have no comment here.

FWIW,
I proposed a "Beam Histogram" metric (bucket counts).
https://s.apache.org/beam-histogram-metrics <https://s.apache.org/beam-histogram-metrics>

(No runner, implements this, and most likely I will not be pursuing this further, due to a change of priority/interest around the meric I was interested in using this for). I was intending to use it for a specific set of metrics metric (No plans to provide a User defined Histogram Metric API) https://s.apache.org/beam-gcp-debuggability <https://s.apache.org/beam-gcp-debuggability>

I don't think we should pursue any plans to package "Beam Distributions" as "Beam Histograms". As a "Beam Histogram" is essential several counters (one for each bucket). Changing all usage of beam.distribution to a "Beam Histograms" would have performance implications, and is not advised. If at some point "Beam Histograms" are implemented, migrating the usage of Metrics.distribution to histogram should be done on an individual basis.

<https://s.apache.org/beam-histogram-metrics>



On Thu, Nov 19, 2020 at 5:47 PM Robert Bradshaw <rober...@google.com <mailto:rober...@google.com>> wrote:

    Guage certainly seems wrong for DistributionResult. Yes, using a
    Histogram would be a welcome PR.

    On Thu, Nov 19, 2020 at 12:58 PM Kyle Weaver <kcwea...@google.com
    <mailto:kcwea...@google.com>> wrote:
     >
     > What are the advantages of using a Histogram instead of a Gauge?
     >
     > Also, check out this design doc for adding histogram metrics to
    Beam if you haven't already: http://s.apache.org/beam-metrics-api
    <http://s.apache.org/beam-metrics-api> (Not sure what the current
    status is.)
     >
     > On Wed, Nov 18, 2020 at 1:37 PM Richard Moorhead
    <richard.moorh...@gmail.com <mailto:richard.moorh...@gmail.com>> wrote:
     >>
     >> Beam's DistributionResult is implemented as a Gauge within the
    Flink runner. Can someone explain the rationale behind this? Would a
    PR to utilize a Histogram be acceptable?

Reply via email to