Kevin Lu created KAFKA-7300: ------------------------------- Summary: Add Fetcher fetch-error-rate and fetch-error-total metrics Key: KAFKA-7300 URL: https://issues.apache.org/jira/browse/KAFKA-7300 Project: Kafka Issue Type: New Feature Components: clients, consumer, metrics Reporter: Kevin Lu Assignee: Kevin Lu
The KafkaConsumer is a complex client that requires many different components to function properly. When a consumer fails, it can be difficult to identify the root cause and which component failed (ConsumerCoordinator, Fetcher, ConsumerNetworkClient, etc). This aims to improve the monitoring and detection of KafkaConsumer’s Fetcher component. Fetcher will send a fetch request for each node that the consumer has assigned partitions for. This fetch request may fail under the following cases: * Intermittent network issues (goes to onFailure) * Node sent an invalid full/incremental fetch response (FetchSessionHandler’s handleResponse returns false) * FetchSessionIdNotFound * InvalidFetchSessionEpochException These cases are logged, but it would be valuable to provide a corresponding metric that allows for monitoring and alerting. -- This message was sent by Atlassian JIRA (v7.6.3#76005)