codeHardest opened a new issue, #20738: URL: https://github.com/apache/pulsar/issues/20738
### Search before asking - [X] I searched in the [issues](https://github.com/apache/pulsar/issues) and found nothing similar. ### Version pulsar Version: 2.11.1 OS: Linux k8s-ma 4.19.0-9-amd64 #1 SMP Debian 4.19.118-2+deb10u1 (2020-06-07) x86_64 GNU/Linux ### Minimal reproduce step broker seems stuck. there is no deadlock found, but all the pulsar-x-x thread are wating to lock a object which already locked by pulsar-2-7 thread . pulsar-2-7 thread self stuck at getBundleDataOrDefault "pulsar-2-7" #127 prio=5 os_prio=0 cpu=941834.56ms elapsed=431390.74s tid=0x00007f3cd401b100 nid=0x142 waiting on condition [0x00007f3c28bf5000] java.lang.Thread.State: WAITING (parking) at jdk.internal.misc.Unsafe.park([email protected]/Native Method) - parking to wait for <0x000000071b78aac8> (a java.util.concurrent.CompletableFuture$Signaller) at java.util.concurrent.locks.LockSupport.park([email protected]/LockSupport.java:211) at java.util.concurrent.CompletableFuture$Signaller.block([email protected]/CompletableFuture.java:1864) at java.util.concurrent.ForkJoinPool.unmanagedBlock([email protected]/ForkJoinPool.java:3463) at java.util.concurrent.ForkJoinPool.managedBlock([email protected]/ForkJoinPool.java:3434) at java.util.concurrent.CompletableFuture.waitingGet([email protected]/CompletableFuture.java:1898) at java.util.concurrent.CompletableFuture.join([email protected]/CompletableFuture.java:2117) at org.apache.pulsar.broker.loadbalance.impl.ModularLoadManagerImpl.getBundleDataOrDefault(ModularLoadManagerImpl.java:389) at org.apache.pulsar.broker.loadbalance.impl.ModularLoadManagerImpl.lambda$selectBrokerForAssignment$8(ModularLoadManagerImpl.java:839) at org.apache.pulsar.broker.loadbalance.impl.ModularLoadManagerImpl$$Lambda$698/0x00000008012b34d0.apply(Unknown Source) at java.util.concurrent.ConcurrentHashMap.computeIfAbsent([email protected]/ConcurrentHashMap.java:1708) - locked <0x000000071b78ab28> (a java.util.concurrent.ConcurrentHashMap$ReservationNode) at org.apache.pulsar.broker.loadbalance.impl.ModularLoadManagerImpl.selectBrokerForAssignment(ModularLoadManagerImpl.java:838) - locked <0x0000000702298510> (a java.util.HashSet) at org.apache.pulsar.broker.loadbalance.impl.ModularLoadManagerWrapper.getLeastLoaded(ModularLoadManagerWrapper.java:68) at org.apache.pulsar.broker.namespace.NamespaceService.getLeastLoadedFromLoadManager(NamespaceService.java:702) at org.apache.pulsar.broker.namespace.NamespaceService.searchForCandidateBroker(NamespaceService.java:525) at org.apache.pulsar.broker.namespace.NamespaceService.lambda$findBrokerServiceUrl$8(NamespaceService.java:410) at org.apache.pulsar.broker.namespace.NamespaceService$$Lambda$696/0x00000008012b3068.run(Unknown Source) at java.util.concurrent.Executors$RunnableAdapter.call([email protected]/Executors.java:539) at java.util.concurrent.FutureTask.run([email protected]/FutureTask.java:264) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run([email protected]/ScheduledThreadPoolExecutor.java:304) at java.util.concurrent.ThreadPoolExecutor.runWorker([email protected]/ThreadPoolExecutor.java:1136) at java.util.concurrent.ThreadPoolExecutor$Worker.run([email protected]/ThreadPoolExecutor.java:635) at io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30) at java.lang.Thread.run([email protected]/Thread.java:833) the whole thread dump are below [dump.txt](https://github.com/apache/pulsar/files/11966613/dump.txt) ### What did you expect to see? not suck ### What did you see instead? stuck, request timeout ### Anything else? _No response_ ### Are you willing to submit a PR? - [X] I'm willing to submit a PR! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
