nick song created KAFKA-10786:
---------------------------------
Summary: ReplicaAlterLogDirsThread gets stuck during the
reassignment of Kafka partition
Key: KAFKA-10786
URL: https://issues.apache.org/jira/browse/KAFKA-10786
Project: Kafka
Issue Type: Bug
Components: log
Affects Versions: 2.0.0
Reporter: nick song
Attachments: attachment 1.png, attachment 2.png, attachment 3.png
Topic config:Configs for topic 'athena_8603' are
leader.replication.throttled.replicas=9:7,9:6,10:8,10:7,8:6,8:5,11:9,11:8,follower.replication.throttled.replicas=9:13,10:0,8:15,11:14,retention.ms=86400000,delete.retention.ms=60000
Reassignment of replica athena_8603-1-15 is still in progress
When I reassigning the topic partition, I found that some tasks have been in
progress, lasting more than ten hours. After investigation, it was found that
ReplicaAlterLogDirsThread was running all the time and occupies a high CPU
usage rate (Attachment 1).
Check the thread information (Attachment 2) and find that the log data is being
copied. Check the log directory (Attachment 3) and find that the index of the
future directory is older than the original log. Is it because the
configuration delete.retention.ms=60000 caused the data to be deleted while
copying ? This causes the replication thread to get stuck. Is there any
solution?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)