[ 
https://issues.apache.org/jira/browse/IGNITE-25678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17973485#comment-17973485
 ] 

Roman Puchkovskiy edited comment on IGNITE-25678 at 6/13/25 12:45 PM:
----------------------------------------------------------------------

I wasn't able to reproduce the issue locally with 500 runs of the test, but 
this is reproducible with hundreds of runs on TC.

It seems that the problem is that a primary replica may stay on the node on 
which we inhibit Metastorage events application. With a fix where primary 
replica is also transferred to another node, I did not see failures with many 
test repeatitions on TC.


was (Author: rpuch):
I wasn't able to reproduce the issue locally with 500 runs of the test, but 
this is reproducible with hundreds of runs on TC.

It seems that the problem is that a primary replica may stay on the node on 
which we inhibit Metastorage events application. With a fix where primary 
replica is also transferred to another node, I did not see failures with many 
test repeats on TC.

> ItSchemaSyncAndReplicationTest#laggingSchemasPreventPartitionDataReplication 
> is flaky
> -------------------------------------------------------------------------------------
>
>                 Key: IGNITE-25678
>                 URL: https://issues.apache.org/jira/browse/IGNITE-25678
>             Project: Ignite
>          Issue Type: Improvement
>            Reporter: Roman Puchkovskiy
>            Assignee: Roman Puchkovskiy
>            Priority: Major
>              Labels: ignite-3
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> [https://ci.ignite.apache.org/buildConfiguration/ApacheIgnite3xGradle_Test_IntegrationTests_ModuleRunner/9217885]
>  
>  
> {code:java}
>  org.apache.ignite.tx.TransactionException: IGN-REP-3 Replication is timed 
> out [replicaGrpId=19_part_0] TraceId:48381a75     at 
> java.base@17.0.6/java.lang.invoke.MethodHandle.invokeWithArguments(MethodHandle.java:732)
>      at 
> app//org.apache.ignite.internal.util.ExceptionUtils$1.copy(ExceptionUtils.java:916)
>      at 
> app//org.apache.ignite.internal.util.ExceptionUtils$ExceptionFactory.createCopy(ExceptionUtils.java:850)
>      at 
> app//org.apache.ignite.internal.util.ExceptionUtils.copyExceptionWithCause(ExceptionUtils.java:652)
>      at 
> app//org.apache.ignite.internal.util.ViewUtils.copyExceptionWithCauseIfPossible(ViewUtils.java:91)
>      at 
> app//org.apache.ignite.internal.util.ViewUtils.ensurePublicException(ViewUtils.java:71)
>      at 
> app//org.apache.ignite.internal.util.ViewUtils.sync(ViewUtils.java:54)     at 
> app//org.apache.ignite.internal.table.KeyValueBinaryViewImpl.put(KeyValueBinaryViewImpl.java:205)
>      at 
> app//org.apache.ignite.internal.table.KeyValueBinaryViewImpl.put(KeyValueBinaryViewImpl.java:69)
>      at 
> app//org.apache.ignite.internal.table.PublicApiThreadingKeyValueView.lambda$put$12(PublicApiThreadingKeyValueView.java:117)
>      at 
> app//org.apache.ignite.internal.thread.PublicApiThreading.lambda$execUserSyncOperation$1(PublicApiThreading.java:116)
>      at 
> app//org.apache.ignite.internal.thread.PublicApiThreading.executeWithRole(PublicApiThreading.java:144)
>      at 
> app//org.apache.ignite.internal.thread.PublicApiThreading.execUserSyncOperation(PublicApiThreading.java:102)
>      at 
> app//org.apache.ignite.internal.thread.PublicApiThreading.execUserSyncOperation(PublicApiThreading.java:115)
>      at 
> app//org.apache.ignite.internal.table.PublicApiThreadingViewBase.executeSyncOp(PublicApiThreadingViewBase.java:111)
>      at 
> app//org.apache.ignite.internal.table.PublicApiThreadingKeyValueView.put(PublicApiThreadingKeyValueView.java:117)
>      at 
> app//org.apache.ignite.internal.restart.RestartProofKeyValueView.lambda$put$12(RestartProofKeyValueView.java:118)
>      at 
> app//org.apache.ignite.internal.restart.RestartProofApiObject.lambda$consumeAttached$1(RestartProofApiObject.java:55)
>      at 
> app//org.apache.ignite.internal.restart.IgniteAttachmentLock.consumeAttached(IgniteAttachmentLock.java:75)
>      at 
> app//org.apache.ignite.internal.restart.RestartProofApiObject.consumeAttached(RestartProofApiObject.java:55)
>      at 
> app//org.apache.ignite.internal.restart.RestartProofKeyValueView.put(RestartProofKeyValueView.java:118)
>      at 
> app//org.apache.ignite.internal.schemasync.ItSchemaSyncAndReplicationTest.putToTableAt(ItSchemaSyncAndReplicationTest.java:140)
>      at 
> app//org.apache.ignite.internal.schemasync.ItSchemaSyncAndReplicationTest.laggingSchemasPreventPartitionDataReplication(ItSchemaSyncAndReplicationTest.java:92)
>      at java.base@17.0.6/java.lang.reflect.Method.invoke(Method.java:568)     
> at java.base@17.0.6/java.util.ArrayList.forEach(ArrayList.java:1511)     at 
> java.base@17.0.6/java.util.ArrayList.forEach(ArrayList.java:1511)   Caused 
> by: org.apache.ignite.tx.TransactionException: IGN-REP-3 Replication is timed 
> out [replicaGrpId=19_part_0] TraceId:48381a75     at 
> app//org.apache.ignite.internal.tx.TransactionExceptionMapperProvider.lambda$mappers$1(TransactionExceptionMapperProvider.java:44)
>      at 
> app//org.apache.ignite.internal.lang.IgniteExceptionMapper.map(IgniteExceptionMapper.java:61)
>      at 
> app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.map(IgniteExceptionMapperUtil.java:195)
>      at 
> app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.mapToPublicException(IgniteExceptionMapperUtil.java:125)
>      at 
> app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.mapToPublicException(IgniteExceptionMapperUtil.java:88)
>      at 
> app//org.apache.ignite.internal.lang.IgniteExceptionMapperUtil.lambda$convertToPublicFuture$3(IgniteExceptionMapperUtil.java:178)
>      at 
> java.base@17.0.6/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:934)
>      at 
> java.base@17.0.6/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:911)
>      at 
> java.base@17.0.6/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
>      at 
> java.base@17.0.6/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162)
>      at 
> app//org.apache.ignite.internal.replicator.ReplicaService.lambda$sendToReplicaRaw$1(ReplicaService.java:149)
>      at 
> java.base@17.0.6/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
>      at 
> java.base@17.0.6/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
>      at java.base@17.0.6/java.lang.Thread.run(Thread.java:833)   Caused by: 
> org.apache.ignite.internal.replicator.exception.ReplicationTimeoutException: 
> IGN-REP-3 Replication is timed out [replicaGrpId=19_part_0] TraceId:48381a75  
>    ... 4 more {code}
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to