Boyang Chen created KAFKA-10011:
-----------------------------------

             Summary: lockedTaskDirectories should be cleared when task gets 
closed dirty in HandleLostAll
                 Key: KAFKA-10011
                 URL: https://issues.apache.org/jira/browse/KAFKA-10011
             Project: Kafka
          Issue Type: Bug
            Reporter: Boyang Chen
            Assignee: Boyang Chen


Tasks who get closed in handleLostAll don't clear out their position inside 
lockedTaskDirectories, which causes an illegal state afterwards:
{code:java}
[2020-05-17T06:21:54-07:00] 
(streams-soak-trunk-eos-beta_soak_i-0b021dbf00474b6aa_streamslog) [2020-05-17 
13:21:54,127] ERROR 
[stream-soak-test-150cf9ae-793b-4aac-bea0-0fb61d228b39-StreamThread-3] 
stream-thread 
[stream-soak-test-150cf9ae-793b-4aac-bea0-0fb61d228b39-StreamThread-3] 
Encountered the following exception during processing and the thread is going 
to shut down:  (org.apache.kafka.streams.processor.internals.StreamThread)
[2020-05-17T06:21:54-07:00] 
(streams-soak-trunk-eos-beta_soak_i-0b021dbf00474b6aa_streamslog) 
org.apache.kafka.streams.errors.ProcessorStateException: task directory 
[/mnt/run/streams/state/stream-soak-test/3_1] doesn't exist and couldn't be 
created
        at 
org.apache.kafka.streams.processor.internals.StateDirectory.directoryForTask(StateDirectory.java:112)
        at 
org.apache.kafka.streams.processor.internals.StateDirectory.checkpointFileFor(StateDirectory.java:121)
        at 
org.apache.kafka.streams.processor.internals.TaskManager.getTaskOffsetSums(TaskManager.java:498)
        at 
org.apache.kafka.streams.processor.internals.StreamsPartitionAssignor.subscriptionUserData(StreamsPartitionAssignor.java:239)
        at 
org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.metadata(ConsumerCoordinator.java:222)
        at 
org.apache.kafka.clients.consumer.internals.AbstractCoordinator.sendJoinGroupRequest(AbstractCoordinator.java:560)
        at 
org.apache.kafka.clients.consumer.internals.AbstractCoordinator.initiateJoinGroup(AbstractCoordinator.java:495)
        at 
org.apache.kafka.clients.consumer.internals.AbstractCoordinator.joinGroupIfNeeded(AbstractCoordinator.java:417)
        at 
org.apache.kafka.clients.consumer.internals.AbstractCoordinator.ensureActiveGroup(AbstractCoordinator.java:358)
        at 
org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.poll(ConsumerCoordinator.java:506)
        at 
org.apache.kafka.clients.consumer.KafkaConsumer.updateAssignmentMetadataIfNeeded(KafkaConsumer.java:1265)
        at 
org.apache.kafka.clients.consumer.KafkaConsumer.poll(KafkaConsumer.java:1231)
        at 
org.apache.kafka.clients.consumer.KafkaConsumer.poll(KafkaConsumer.java:1206)
        at 
org.apache.kafka.streams.processor.internals.StreamThread.pollRequests(StreamThread.java:770)
        at 
org.apache.kafka.streams.processor.internals.StreamThread.runOnce(StreamThread.java:630)
        at 
org.apache.kafka.streams.processor.internals.StreamThread.runLoop(StreamThread.java:550)
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to