Denis Chudov created IGNITE-25808: ------------------------------------- Summary: Lease negotiator may flood the log if a candidate left and topology tracker not updated Key: IGNITE-25808 URL: https://issues.apache.org/jira/browse/IGNITE-25808 Project: Ignite Issue Type: Bug Reporter: Denis Chudov
If there are hundreds of tables and there is a delay in the placement driver TopologyTracker for any reason, the placement driver knows nothing about the topology change and keeps choosing the offline node as a candidate. It is not a critical issue, but may fill the logs with warnings in just a few seconds: {code:java} 2025-05-06 10:06:44:672 +0000 [WARNING][%gridgain-2.novalocal%JRaft-FSMCaller-Disruptor-metastorage_group_stripe_0-0][LeaseNegotiator] Lease was not negotiated due to exception [lease=Lease [leaseholder=gridgain-5.novalocal, leaseholderId=3844f6cb-ee97-49bd-8d40-5b96d0a6bc82, accepted=false, startTime=HybridTimestamp [physical=2025-05-06 10:06:44:602 +0000, logical=114, composite=114460328237596786], expirationTime=HybridTimestamp [physical=2025-05-06 10:08:44:602 +0000, logical=0, composite=114460336101916672], prolongable=true, proposedCandidate=null, replicationGroupId=590_part_13]]org.apache.ignite.internal.network.UnresolvableConsistentIdException: IGN-NETWORK-1 TraceId:57c4ef42-4786-431b-a6ff-a7d79edcfd6a Recipient consistent ID cannot be resolved: gridgain-5.novalocal at org.apache.ignite.internal.network.DefaultMessagingService.invoke(DefaultMessagingService.java:231) at org.apache.ignite.internal.network.MessagingService.invoke(MessagingService.java:190) at org.apache.ignite.internal.placementdriver.negotiation.LeaseNegotiator.negotiate(LeaseNegotiator.java:63) at org.apache.ignite.internal.placementdriver.LeaseUpdater$Updater.lambda$updateLeaseBatchInternal$0(LeaseUpdater.java:590) at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147) at org.apache.ignite.internal.raft.RaftGroupServiceImpl.lambda$sendWithRetry$49(RaftGroupServiceImpl.java:603) at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147) at org.apache.ignite.internal.network.DefaultMessagingService.onInvokeResponse(DefaultMessagingService.java:575) at org.apache.ignite.internal.network.DefaultMessagingService.send0(DefaultMessagingService.java:261) at org.apache.ignite.internal.network.DefaultMessagingService.respond(DefaultMessagingService.java:204) at org.apache.ignite.internal.network.MessagingService.respond(MessagingService.java:107) at org.apache.ignite.raft.jraft.rpc.impl.IgniteRpcServer$NetworkRpcContext.sendResponse(IgniteRpcServer.java:240) at org.apache.ignite.raft.jraft.rpc.impl.ActionRequestProcessor.sendResponse(ActionRequestProcessor.java:268) at org.apache.ignite.raft.jraft.rpc.impl.ActionRequestProcessor$1.result(ActionRequestProcessor.java:177) at org.apache.ignite.internal.raft.server.impl.JraftServerImpl$WriteCommandIterator$1.result(JraftServerImpl.java:995) at org.apache.ignite.internal.metastorage.server.raft.MetaStorageWriteHandler$ResultCachingClosure.result(MetaStorageWriteHandler.java:453) at org.apache.ignite.internal.metastorage.server.raft.MetaStorageWriteHandler.handleWriteWithTime(MetaStorageWriteHandler.java:217) at org.apache.ignite.internal.metastorage.server.raft.MetaStorageWriteHandler.handleNonCachedWriteCommand(MetaStorageWriteHandler.java:153) at org.apache.ignite.internal.metastorage.server.raft.MetaStorageWriteHandler.handleWriteCommand(MetaStorageWriteHandler.java:123) at java.base/java.util.Iterator.forEachRemaining(Iterator.java:133) at org.apache.ignite.internal.metastorage.server.raft.MetaStorageListener.onWrite(MetaStorageListener.java:193) at org.apache.ignite.internal.raft.server.impl.JraftServerImpl$DelegatingStateMachine.onApply(JraftServerImpl.java:825) at org.apache.ignite.raft.jraft.core.FSMCallerImpl.doApplyTasks(FSMCallerImpl.java:570) at org.apache.ignite.raft.jraft.core.FSMCallerImpl.doCommitted(FSMCallerImpl.java:536) at org.apache.ignite.raft.jraft.core.FSMCallerImpl.runApplyTask(FSMCallerImpl.java:454) at org.apache.ignite.raft.jraft.core.FSMCallerImpl$ApplyTaskHandler.onEvent(FSMCallerImpl.java:123) at org.apache.ignite.raft.jraft.core.FSMCallerImpl$ApplyTaskHandler.onEvent(FSMCallerImpl.java:117) at org.apache.ignite.raft.jraft.disruptor.StripedDisruptor$StripeEntryHandler.onEvent(StripedDisruptor.java:322) at org.apache.ignite.raft.jraft.disruptor.StripedDisruptor$StripeEntryHandler.onEvent(StripedDisruptor.java:279) at com.lmax.disruptor.BatchEventProcessor.processEvents(BatchEventProcessor.java:167) at com.lmax.disruptor.BatchEventProcessor.run(BatchEventProcessor.java:122) at java.base/java.lang.Thread.run(Thread.java:840) {code} As a possible solution, we may throttle this log. -- This message was sent by Atlassian Jira (v8.20.10#820010)