Alexander created KAFKA-13384: --------------------------------- Summary: FailedPartitionsCount metric is not updated if a partition log file was corrupted Key: KAFKA-13384 URL: https://issues.apache.org/jira/browse/KAFKA-13384 Project: Kafka Issue Type: Bug Affects Versions: 2.6.2 Environment: OS: NAME="Amazon Linux AMI" VERSION="2018.03" ID="amzn" ID_LIKE="rhel fedora" VERSION_ID="2018.03" PRETTY_NAME="Amazon Linux AMI 2018.03" CPE_NAME="cpe:/o:amazon:linux:2018.03:ga" HOME_URL="http://aws.amazon.com/amazon-linux-ami/"
Kafka version: 2.6.2 (Commit:da65af02e5856e34) Reporter: Alexander Attachments: Screenshot 2021-10-19 at 15.28.33.png We found a misbehavior on our Kafka cluster (version: 2.6.2 (Commit:da65af02e5856e34)), `FailedPartitionsCount` metric is not updated if a partition log file was corrupted Steps to reproduce the problem: 1. corrupt a partition log file 2. restart Kafka process After that, you will get a correct log which tells that Kafka marked corrupted partitions as failed ``` 2021-10-19T14:49:31+02:00 [2021-10-19 12:49:30,924] WARN [ReplicaFetcher replicaId=11, leaderId=10, fetcherId=0] Partition test_topic-1 marked as failed (kafka.server.ReplicaFetcherThread) ``` But the value of `FailedPartitionsCount` metric will be 0 (see attached screenshot) -- This message was sent by Atlassian Jira (v8.3.4#803005)