Kevin Lu created KAFKA-7300:
-------------------------------

             Summary: Add Fetcher fetch-error-rate and fetch-error-total 
metrics 
                 Key: KAFKA-7300
                 URL: https://issues.apache.org/jira/browse/KAFKA-7300
             Project: Kafka
          Issue Type: New Feature
          Components: clients, consumer, metrics
            Reporter: Kevin Lu
            Assignee: Kevin Lu


The KafkaConsumer is a complex client that requires many different components 
to function properly. When a consumer fails, it can be difficult to identify 
the root cause and which component failed (ConsumerCoordinator, Fetcher, 
ConsumerNetworkClient, etc).

 

This aims to improve the monitoring and detection of KafkaConsumer’s Fetcher 
component.

 

Fetcher will send a fetch request for each node that the consumer has assigned 
partitions for.

 

This fetch request may fail under the following cases:
 * Intermittent network issues (goes to onFailure)
 * Node sent an invalid full/incremental fetch response (FetchSessionHandler’s 
handleResponse returns false)
 * FetchSessionIdNotFound
 * InvalidFetchSessionEpochException

 

These cases are logged, but it would be valuable to provide a corresponding 
metric that allows for monitoring and alerting.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to