[ https://issues.apache.org/jira/browse/IGNITE-25678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17973485#comment-17973485 ]
Roman Puchkovskiy edited comment on IGNITE-25678 at 6/13/25 12:45 PM: ---------------------------------------------------------------------- I wasn't able to reproduce the issue locally with 500 runs of the test, but this is reproducible with hundreds of runs on TC. It seems that the problem is that a primary replica may stay on the node on which we inhibit Metastorage events application. With a fix where primary replica is also transferred to another node, I did not see failures with many test repeatitions on TC. was (Author: rpuch): I wasn't able to reproduce the issue locally with 500 runs of the test, but this is reproducible with hundreds of runs on TC. It seems that the problem is that a primary replica may stay on the node on which we inhibit Metastorage events application. With a fix where primary replica is also transferred to another node, I did not see failures with many test repeats on TC. > ItSchemaSyncAndReplicationTest#laggingSchemasPreventPartitionDataReplication > is flaky > ------------------------------------------------------------------------------------- > > Key: IGNITE-25678 > URL: https://issues.apache.org/jira/browse/IGNITE-25678 > Project: Ignite > Issue Type: Improvement > Reporter: Roman Puchkovskiy > Assignee: Roman Puchkovskiy > Priority: Major > Labels: ignite-3 > Time Spent: 10m > Remaining Estimate: 0h > > [https://ci.ignite.apache.org/buildConfiguration/ApacheIgnite3xGradle_Test_IntegrationTests_ModuleRunner/9217885] > > > {code:java} > org.apache.ignite.tx.TransactionException: IGN-REP-3 Replication is timed > out [replicaGrpId=19_part_0] TraceId:48381a75 at > java.base@17.0.6/java.lang.invoke.MethodHandle.invokeWithArguments(MethodHandle.java:732) > at > app//org.apache.ignite.internal.util.ExceptionUtils$1.copy(ExceptionUtils.java:916) > at > app//org.apache.ignite.internal.util.ExceptionUtils$ExceptionFactory.createCopy(ExceptionUtils.java:850) > at > app//org.apache.ignite.internal.util.ExceptionUtils.copyExceptionWithCause(ExceptionUtils.java:652) > at > app//org.apache.ignite.internal.util.ViewUtils.copyExceptionWithCauseIfPossible(ViewUtils.java:91) > at > app//org.apache.ignite.internal.util.ViewUtils.ensurePublicException(ViewUtils.java:71) > at > app//org.apache.ignite.internal.util.ViewUtils.sync(ViewUtils.java:54) at > app//org.apache.ignite.internal.table.KeyValueBinaryViewImpl.put(KeyValueBinaryViewImpl.java:205) > at > app//org.apache.ignite.internal.table.KeyValueBinaryViewImpl.put(KeyValueBinaryViewImpl.java:69) > at > app//org.apache.ignite.internal.table.PublicApiThreadingKeyValueView.lambda$put$12(PublicApiThreadingKeyValueView.java:117) > at > app//org.apache.ignite.internal.thread.PublicApiThreading.lambda$execUserSyncOperation$1(PublicApiThreading.java:116) > at > app//org.apache.ignite.internal.thread.PublicApiThreading.executeWithRole(PublicApiThreading.java:144) > at > app//org.apache.ignite.internal.thread.PublicApiThreading.execUserSyncOperation(PublicApiThreading.java:102) > at > app//org.apache.ignite.internal.thread.PublicApiThreading.execUserSyncOperation(PublicApiThreading.java:115) > at > app//org.apache.ignite.internal.table.PublicApiThreadingViewBase.executeSyncOp(PublicApiThreadingViewBase.java:111) > at > app//org.apache.ignite.internal.table.PublicApiThreadingKeyValueView.put(PublicApiThreadingKeyValueView.java:117) > at > app//org.apache.ignite.internal.restart.RestartProofKeyValueView.lambda$put$12(RestartProofKeyValueView.java:118) > at > app//org.apache.ignite.internal.restart.RestartProofApiObject.lambda$consumeAttached$1(RestartProofApiObject.java:55) > at > app//org.apache.ignite.internal.restart.IgniteAttachmentLock.consumeAttached(IgniteAttachmentLock.java:75) > at > app//org.apache.ignite.internal.restart.RestartProofApiObject.consumeAttached(RestartProofApiObject.java:55) > at > app//org.apache.ignite.internal.restart.RestartProofKeyValueView.put(RestartProofKeyValueView.java:118) > at > app//org.apache.ignite.internal.schemasync.ItSchemaSyncAndReplicationTest.putToTableAt(ItSchemaSyncAndReplicationTest.java:140) > at > app//org.apache.ignite.internal.schemasync.ItSchemaSyncAndReplicationTest.laggingSchemasPreventPartitionDataReplication(ItSchemaSyncAndReplicationTest.java:92) > at java.base@17.0.6/java.lang.reflect.Method.invoke(Method.java:568) > at java.base@17.0.6/java.util.ArrayList.forEach(ArrayList.java:1511) at > java.base@17.0.6/java.util.ArrayList.forEach(ArrayList.java:1511) Caused > by: org.apache.ignite.tx.TransactionException: IGN-REP-3 Replication is timed > out [replicaGrpId=19_part_0] TraceId:48381a75 at > app//org.apache.ignite.internal.tx.TransactionExceptionMapperProvider.lambda$mappers$1(TransactionExceptionMapperProvider.java:44) > at > app//org.apache.ignite.internal.lang.IgniteExceptionMapper.map(IgniteExceptionMapper.java:61) > at > app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.map(IgniteExceptionMapperUtil.java:195) > at > app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.mapToPublicException(IgniteExceptionMapperUtil.java:125) > at > app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.mapToPublicException(IgniteExceptionMapperUtil.java:88) > at > app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.lambda$convertToPublicFuture$3(IgniteExceptionMapperUtil.java:178) > at > java.base@17.0.6/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:934) > at > java.base@17.0.6/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:911) > at > java.base@17.0.6/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) > at > java.base@17.0.6/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162) > at > app//org.apache.ignite.internal.replicator.ReplicaService.lambda$sendToReplicaRaw$1(ReplicaService.java:149) > at > java.base@17.0.6/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) > at > java.base@17.0.6/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) > at java.base@17.0.6/java.lang.Thread.run(Thread.java:833) Caused by: > org.apache.ignite.internal.replicator.exception.ReplicationTimeoutException: > IGN-REP-3 Replication is timed out [replicaGrpId=19_part_0] TraceId:48381a75 > ... 4 more {code} > -- This message was sent by Atlassian Jira (v8.20.10#820010)