Sophie Blee-Goldman created KAFKA-10144:
-------------------------------------------

             Summary: Corrupted standby tasks are not always cleaned up
                 Key: KAFKA-10144
                 URL: https://issues.apache.org/jira/browse/KAFKA-10144
             Project: Kafka
          Issue Type: Bug
          Components: streams
            Reporter: Sophie Blee-Goldman
            Assignee: Sophie Blee-Goldman
             Fix For: 2.6.0


Thread death on the 2.6-eos-beta soak was due to re-registration of a standby 
task changelog that was already registered. The root cause was that the task 
had been marked corrupted, but `commit` threw a TaskMigratedException before we 
could get to calling TaskManager#handleCorruption and properly clean up the 
task.

For corrupted active tasks this is not a problem, since #handleLostAll will 
then finish the cleanup. But we intentionally don't clear standbys tasks on 
TaskMigratedException, leaving the task corrupted and partially registered



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to