[
https://issues.apache.org/jira/browse/IGNITE-23690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17899496#comment-17899496
]
Roman Puchkovskiy commented on IGNITE-23690:
--------------------------------------------
The patch looks good to me
> ItTableRaftSnapshotsTest.txSemanticsIsMaintained is flaky
> ---------------------------------------------------------
>
> Key: IGNITE-23690
> URL: https://issues.apache.org/jira/browse/IGNITE-23690
> Project: Ignite
> Issue Type: Bug
> Reporter: Roman Puchkovskiy
> Assignee: Alexander Lapin
> Priority: Major
> Labels: ignite-3
> Time Spent: 10m
> Remaining Estimate: 0h
>
> I observed two kinds of failures:
> Caused by:
> org.apache.ignite.internal.network.UnresolvableConsistentIdException:
> IGN-NETWORK-1 TraceId:00e308b8-3cd9-4f26-ac2b-32a31448022c Recipient
> consistent ID cannot be resolved: itrst_tsim_2 at
> org.apache.ignite.internal.network.DefaultMessagingService.invoke(DefaultMessagingService.java:243)
> at
> org.apache.ignite.internal.network.wrapper.JumpToExecutorByConsistentIdAfterSend.invoke(JumpToExecutorByConsistentIdAfterSend.java:97)
> at
> org.apache.ignite.internal.network.MessagingService.invoke(MessagingService.java:198)
> at
> org.apache.ignite.internal.tx.impl.TxMessageSender.cleanup(TxMessageSender.java:131)
> at
> org.apache.ignite.internal.tx.impl.TxCleanupRequestSender.sendCleanupMessageWithRetries(TxCleanupRequestSender.java:240)
> at
> org.apache.ignite.internal.tx.impl.TxCleanupRequestSender.cleanupPartitions(TxCleanupRequestSender.java:226)
> at
> org.apache.ignite.internal.tx.impl.TxCleanupRequestSender.cleanup(TxCleanupRequestSender.java:165)
> at
> org.apache.ignite.internal.tx.impl.TxManagerImpl.cleanup(TxManagerImpl.java:819)
> at
> org.apache.ignite.internal.table.distributed.replicator.PartitionReplicaListener.lambda$finishAndCleanup$58(PartitionReplicaListener.java:1691)
> at
> java.base/java.util.concurrent.CompletableFuture$UniCompose.tryFire(CompletableFuture.java:1072)
> ... 12 more
> and
> Caused by: org.apache.ignite.internal.tx.impl.PrimaryReplicaExpiredException:
> IGN-TX-13 TraceId:d2f4f111-dabc-4f4a-b41f-280552d42656 Primary replica has
> expired, transaction will be rolled back: [groupId = 16_part_0, expected
> enlistment consistency token = 113499281800495104, commit timestamp = null,
> current primary replica = null] at
> org.apache.ignite.internal.tx.impl.TransactionInflights$ReadWriteTxContext.cancelWaitingInflights(TransactionInflights.java:318)
> at
> org.apache.ignite.internal.tx.impl.TransactionInflights.cancelWaitingInflights(TransactionInflights.java:134)
> at
> org.apache.ignite.internal.tx.impl.TxManagerImpl.lambda$primaryReplicaEventListener$0(TxManagerImpl.java:351)
> at
> org.apache.ignite.internal.util.IgniteUtils.inBusyLock(IgniteUtils.java:835)
> at
> org.apache.ignite.internal.tx.impl.TxManagerImpl.primaryReplicaEventListener(TxManagerImpl.java:344)
> at
> org.apache.ignite.internal.tx.impl.TxManagerImpl.primaryReplicaExpiredListener(TxManagerImpl.java:368)
> at
> org.apache.ignite.internal.event.AbstractEventProducer.fireEvent(AbstractEventProducer.java:88)
> at
> org.apache.ignite.internal.placementdriver.leases.LeaseTracker.fireEventPrimaryReplicaExpired(LeaseTracker.java:414)
> at
> org.apache.ignite.internal.placementdriver.leases.LeaseTracker$UpdateListener.lambda$onUpdate$2(LeaseTracker.java:225)
> It seems that in both cases the transaction has to be eventually committed as
> the failure is temporal.
> h3. Upd 1
> Occurred that it's a test issue: test will fail if node with index 2 that we
> knock out is primary.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)