zentol commented on PR #17834:
URL: https://github.com/apache/flink/pull/17834#issuecomment-1397207313

   So gauges are a bit annoying. Ideally we would only have Number gauges 
because really nothing else is properly supported by a majority of systems.
   
   Boolean gauges are usually a mistake from the get-go as well.
   Lets take the `isBackpressured` metric. This metric tells you whether the 
task is back-pressured right now at this very moment.
   That's a terrible metric to make any decision, and you should rather use 
`backPressuredTimeMsPerSecond` because it's not susceptible to bad luck. 
`isBackpressured` is only accurate if you are either 100% or 0% back-pressured; 
for everything in-between it's quite inaccurate (especially since the sampling 
interval is the reporting interval, aka typically in the order of seconds) 
(edited) 
   
   That's why the PR didn't receive any attention. In a way it'd only enable 
users to rely on bad metrics. Sure, consistency across reporters isn't a bad 
argument, but this consistency should still provide some real value.
   
   Mapping booleans to ints isn't necessarily sound as well, because 
aggregating them isn't obvious.
   If we really wanted to supported gauges we'd ideally map them to a 
distribution I guess.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to