John Roesler created KAFKA-9557:
-----------------------------------

             Summary: Thread-level "process" metrics are computed incorrectly
                 Key: KAFKA-9557
                 URL: https://issues.apache.org/jira/browse/KAFKA-9557
             Project: Kafka
          Issue Type: Bug
          Components: streams
            Reporter: John Roesler
            Assignee: John Roesler


Among others, Streams reports the following two thread-level "process" metrics:

"process-rate": The average number of process calls per second.
"process-total": The total number of process calls across all tasks.

See the docs: 
https://kafka.apache.org/documentation/#kafka_streams_thread_monitoring

There's some surprising ambiguity in these definitions that has led to Streams 
actually reporting something different than what most people would probably 
expect. Specifically, it's not defined what a "process call" is.

A reasonable definition of a "process call" is processing a record or 
processing a task (both of which are publicly facing concepts, and both of 
which are the same, since tasks process records one at a time). However, we 
currently measure number of invocations to a private, internal `process()` 
method, which would actually process more than one record at a time. Thus, the 
current metric is under-counting the throughput, in an esoteric and confusing 
way.

Instead, we should simply change the rate and total metrics to measure the 
(rate and total) of _record_ processing.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to