[ https://issues.apache.org/jira/browse/IGNITE-20628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17827475#comment-17827475 ]
Vyacheslav Koptilin commented on IGNITE-20628: ---------------------------------------------- It looks like, this ticket depends on IGNITE-21572. Need to check when the fix is ready. > testDropColumn and testMergeChangesAddDropAdd in ItSchemaChangeKvViewTest are > disabled > -------------------------------------------------------------------------------------- > > Key: IGNITE-20628 > URL: https://issues.apache.org/jira/browse/IGNITE-20628 > Project: Ignite > Issue Type: Bug > Reporter: Roman Puchkovskiy > Priority: Major > Labels: ignite-3, tech-debt > Fix For: 3.0.0-beta2 > > > It was supposed that IGNITE-17931 was the culprit, but even after removing > the blocking code the tests are still flaky. > The tests fail with one of 3 symptoms: > # An NPE happens in the test method code: a value by a key for which a put > is made earlier is not found when using the same key. This is probably caused > by a transactional protocol implementation bug, maybe this: IGNITE-20116 > # A PrimaryReplicaAwaitTimeoutException > # A ReplicationTimeoutException > Items 2 and 3 need to be investigated. > h2. A stacktrace for 1 > java.lang.NullPointerException > at > org.apache.ignite.internal.runner.app.ItSchemaChangeKvViewTest.testDropColumn(ItSchemaChangeKvViewTest.java:58) > h2. A stacktrace for 2 > org.apache.ignite.tx.TransactionException: IGN-PLACEMENTDRIVER-1 > TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 Failed to get the primary > replica [tablePartitionId=3_part_5, awaitTimestamp=HybridTimestamp > [time=111220884095959043, physical=1697096009765, logical=3]] > > at > org.apache.ignite.internal.util.ExceptionUtils.lambda$withCause$1(ExceptionUtils.java:400) > at > org.apache.ignite.internal.util.ExceptionUtils.withCauseInternal(ExceptionUtils.java:461) > at > org.apache.ignite.internal.util.ExceptionUtils.withCause(ExceptionUtils.java:400) > at > org.apache.ignite.internal.table.distributed.storage.InternalTableImpl.lambda$enlist$71(InternalTableImpl.java:1659) > at > java.base/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:930) > at > java.base/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:907) > at > java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506) > at > java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088) > at > java.base/java.util.concurrent.CompletableFuture$Timeout.run(CompletableFuture.java:2792) > at > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) > at > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) > at > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) > at > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) > at java.base/java.lang.Thread.run(Thread.java:834) > Caused by: java.util.concurrent.CompletionException: > org.apache.ignite.internal.placementdriver.PrimaryReplicaAwaitTimeoutException: > IGN-PLACEMENTDRIVER-1 TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 The > primary replica await timed out [replicationGroupId=3_part_5, > referenceTimestamp=HybridTimestamp [time=111220884095959043, > physical=1697096009765, logical=3], currentLease=Lease > [leaseholder=isckvt_tmcada_3346, accepted=false, startTime=HybridTimestamp > [time=111220884127809550, physical=1697096010251, logical=14], > expirationTime=HybridTimestamp [time=111220891992129536, > physical=1697096130251, logical=0], prolongable=false, > replicationGroupId=3_part_5]] > at > java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:314) > at > java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:319) > at > java.base/java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:990) > at > java.base/java.util.concurrent.CompletableFuture$UniExceptionally.tryFire(CompletableFuture.java:970) > ... 9 more > Caused by: > org.apache.ignite.internal.placementdriver.PrimaryReplicaAwaitTimeoutException: > IGN-PLACEMENTDRIVER-1 TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 The > primary replica await timed out [replicationGroupId=3_part_5, > referenceTimestamp=HybridTimestamp [time=111220884095959043, > physical=1697096009765, logical=3], currentLease=Lease > [leaseholder=isckvt_tmcada_3346, accepted=false, startTime=HybridTimestamp > [time=111220884127809550, physical=1697096010251, logical=14], > expirationTime=HybridTimestamp [time=111220891992129536, > physical=1697096130251, logical=0], prolongable=false, > replicationGroupId=3_part_5]] > at > org.apache.ignite.internal.placementdriver.leases.LeaseTracker.lambda$awaitPrimaryReplica$2(LeaseTracker.java:229) > at > java.base/java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:986) > ... 10 more > Caused by: java.util.concurrent.TimeoutException > ... 7 more > h2. A stacktrace for 3 > org.apache.ignite.tx.TransactionException: IGN-REP-3 > TraceId:d41dcd22-5370-47cd-837b-c23268480162 Replication is timed out > [replicaGrpId=3_part_5] > > at > org.apache.ignite.internal.util.ExceptionUtils.lambda$withCause$0(ExceptionUtils.java:378) > at > org.apache.ignite.internal.util.ExceptionUtils.withCauseInternal(ExceptionUtils.java:461) > at > org.apache.ignite.internal.util.ExceptionUtils.withCause(ExceptionUtils.java:378) > at > org.apache.ignite.internal.table.distributed.storage.InternalTableImpl.wrapReplicationException(InternalTableImpl.java:1891) > at > org.apache.ignite.internal.table.distributed.storage.InternalTableImpl.lambda$postEnlist$12(InternalTableImpl.java:564) > at > java.base/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:930) > at > java.base/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:907) > at > java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506) > at > java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088) > at > org.apache.ignite.internal.table.distributed.storage.InternalTableImpl.lambda$enlistWithRetry$7(InternalTableImpl.java:483) > at > java.base/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:930) > at > java.base/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:907) > at > java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506) > at > java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088) > at > org.apache.ignite.internal.replicator.ReplicaService.lambda$sendToReplica$3(ReplicaService.java:97) > at > java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:859) > at > java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:837) > at > java.base/java.util.concurrent.CompletableFuture$Completion.exec(CompletableFuture.java:479) > at java.base/java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:290) > at > java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1020) > at java.base/java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1656) > at > java.base/java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1594) > at > java.base/java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:177) > Caused by: > org.apache.ignite.internal.replicator.exception.ReplicationTimeoutException: > IGN-REP-3 TraceId:d41dcd22-5370-47cd-837b-c23268480162 Replication is timed > out [replicaGrpId=3_part_5] > ... 9 more > -- This message was sent by Atlassian Jira (v8.20.10#820010)