Michael Bingham created KAFKA-10564:
---------------------------------------

             Summary: Continuous logging about deleting obsolete state 
directories
                 Key: KAFKA-10564
                 URL: https://issues.apache.org/jira/browse/KAFKA-10564
             Project: Kafka
          Issue Type: Bug
          Components: streams
    Affects Versions: 2.6.0
            Reporter: Michael Bingham


The internal process which automatically cleans up obsolete task state 
directories was modified in https://issues.apache.org/jira/browse/KAFKA-6647. 
The current logic in 2.6 is to remove all files from the task directory except 
the {{.lock}} file:

[https://github.com/apache/kafka/blob/2.6/streams/src/main/java/org/apache/kafka/streams/processor/internals/StateDirectory.java#L335]

However, the directory is only removed in its entirely for a manual cleanup:

[https://github.com/apache/kafka/blob/2.6/streams/src/main/java/org/apache/kafka/streams/processor/internals/StateDirectory.java#L349-L353]

The result of this is that Streams will continue revisiting this directory and 
trying to clean it up, since it determines what to clean based on 
last-modification time of the task directory (which is now no longer deleted 
during the automatic cleanup). So a user will see log messages like:
stream-thread [app-c2d773e6-8ac3-4435-9777-378e0ec0ab82-CleanupThread] Deleting 
obsolete state directory 0_8 for task 0_8 as 60600001ms has elapsed (cleanup 
delay is 600000ms)
repeated again and again.

This issue doesn't seem to break anything - it's more about avoiding 
unnecessary logging and cleaning up empty task directories.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to