Re: [PR] KAFKA-17142: Fix deadlock caused by LogManagerTest#testLogRecoveryMetrics [kafka]

via GitHub Sat, 20 Jul 2024 09:59:11 -0700


chia7712 commented on code in PR #16614:
URL: https://github.com/apache/kafka/pull/16614#discussion_r1685502849



##########
storage/src/main/java/org/apache/kafka/storage/internals/epoch/LeaderEpochFileCache.java:
##########
@@ -348,7 +348,8 @@ public void truncateFromEndAsyncFlush(long endOffset) {
                 // - We still flush the change in #assign synchronously, 
meaning that it's guaranteed that the checkpoint file always has no missing 
entries.
                 //   * Even when stale epochs are restored from the checkpoint 
file after the unclean shutdown, it will be handled by
                 //     another truncateFromEnd call on log loading procedure, 
so it won't be a problem
-                scheduler.scheduleOnce("leader-epoch-cache-flush-" + 
topicPartition, this::writeToFileForTruncation);
+                List<EpochEntry> entries = new ArrayList<>(epochs.values());
+                scheduler.scheduleOnce("leader-epoch-cache-flush-" + 
topicPartition, () -> checkpoint.writeForTruncation(entries));

Review Comment:
   > This will bring back the deadlock issue in the test, right?
   
   yes, it does. However, my point was - if it needs more discussion for 
@ocadaruma comment: "Yeah, could be an issue in some cases (e.g. deleteRecords 
is called frequently, and/or kafka-schedulers are busy) though.", we can 
improve the test before adding `writeToFileForTruncation` back to production. 
   
   At any rate, it seems we all agree to have the simple fix for now, and so I 
merge KAFKA-17166 and KAFKA-17167



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-17142: Fix deadlock caused by LogManagerTest#testLogRecoveryMetrics [kafka]

Reply via email to