[jira] [Commented] (IGNITE-20628) testDropColumn and testMergeChangesAddDropAdd in ItSchemaChangeKvViewTest are disabled

2024-04-05 Thread Denis Chudov (Jira)


[ 
https://issues.apache.org/jira/browse/IGNITE-20628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17834245#comment-17834245
 ] 

Denis Chudov commented on IGNITE-20628:
---

IGNITE-21572 is resolved, but due to the rare occurrence of this error, we need 
to monitor the teamcity for some time (about a month). After that, if this 
error is no longer reproduced, we can close this ticket.

> testDropColumn and testMergeChangesAddDropAdd in ItSchemaChangeKvViewTest are 
> disabled
> --
>
> Key: IGNITE-20628
> URL: https://issues.apache.org/jira/browse/IGNITE-20628
> Project: Ignite
>  Issue Type: Bug
>Reporter: Roman Puchkovskiy
>Priority: Major
>  Labels: ignite-3, tech-debt
> Fix For: 3.0.0-beta2
>
>
> It was supposed that IGNITE-17931 was the culprit, but even after removing 
> the blocking code the tests are still flaky.
> The tests fail with one of 3 symptoms:
>  # An NPE happens in the test method code: a value by a key for which a put 
> is made earlier is not found when using the same key. This is probably caused 
> by a transactional protocol implementation bug, maybe this: IGNITE-20116
>  # A PrimaryReplicaAwaitTimeoutException
>  # A ReplicationTimeoutException
> Items 2 and 3 need to be investigated.
> h2. A stacktrace for 1
> java.lang.NullPointerException
>     at 
> org.apache.ignite.internal.runner.app.ItSchemaChangeKvViewTest.testDropColumn(ItSchemaChangeKvViewTest.java:58)
> h2. A stacktrace for 2
> org.apache.ignite.tx.TransactionException: IGN-PLACEMENTDRIVER-1 
> TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 Failed to get the primary 
> replica [tablePartitionId=3_part_5, awaitTimestamp=HybridTimestamp 
> [time=111220884095959043, physical=1697096009765, logical=3]]
>  
> at 
> org.apache.ignite.internal.util.ExceptionUtils.lambda$withCause$1(ExceptionUtils.java:400)
> at 
> org.apache.ignite.internal.util.ExceptionUtils.withCauseInternal(ExceptionUtils.java:461)
> at 
> org.apache.ignite.internal.util.ExceptionUtils.withCause(ExceptionUtils.java:400)
> at 
> org.apache.ignite.internal.table.distributed.storage.InternalTableImpl.lambda$enlist$71(InternalTableImpl.java:1659)
> at 
> java.base/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:930)
> at 
> java.base/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:907)
> at 
> java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
> at 
> java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088)
> at 
> java.base/java.util.concurrent.CompletableFuture$Timeout.run(CompletableFuture.java:2792)
> at 
> java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
> at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> at 
> java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> at java.base/java.lang.Thread.run(Thread.java:834)
> Caused by: java.util.concurrent.CompletionException: 
> org.apache.ignite.internal.placementdriver.PrimaryReplicaAwaitTimeoutException:
>  IGN-PLACEMENTDRIVER-1 TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 The 
> primary replica await timed out [replicationGroupId=3_part_5, 
> referenceTimestamp=HybridTimestamp [time=111220884095959043, 
> physical=1697096009765, logical=3], currentLease=Lease 
> [leaseholder=isckvt_tmcada_3346, accepted=false, startTime=HybridTimestamp 
> [time=111220884127809550, physical=1697096010251, logical=14], 
> expirationTime=HybridTimestamp [time=111220891992129536, 
> physical=1697096130251, logical=0], prolongable=false, 
> replicationGroupId=3_part_5]]
> at 
> java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:314)
> at 
> java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:319)
> at 
> java.base/java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:990)
> at 
> java.base/java.util.concurrent.CompletableFuture$UniExceptionally.tryFire(CompletableFuture.java:970)
> ... 9 more
> Caused by: 
> org.apache.ignite.internal.placementdriver.PrimaryReplicaAwaitTimeoutException:
>  IGN-PLACEMENTDRIVER-1 TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 The 
> primary replica await timed out [replicationGroupId=3_part_5, 
> referenceTimestamp=HybridTimestamp [time=111220884095959043, 
> physical=1697096009765, logical=3], currentLease=Lease 
> [leaseholder=isckvt_tmcada_3346, accepted=false, startTime=HybridTimestamp 
> 

[jira] [Commented] (IGNITE-20628) testDropColumn and testMergeChangesAddDropAdd in ItSchemaChangeKvViewTest are disabled

2024-03-15 Thread Vyacheslav Koptilin (Jira)


[ 
https://issues.apache.org/jira/browse/IGNITE-20628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827475#comment-17827475
 ] 

Vyacheslav Koptilin commented on IGNITE-20628:
--

It looks like, this ticket depends on IGNITE-21572. Need to check when the fix 
is ready.

> testDropColumn and testMergeChangesAddDropAdd in ItSchemaChangeKvViewTest are 
> disabled
> --
>
> Key: IGNITE-20628
> URL: https://issues.apache.org/jira/browse/IGNITE-20628
> Project: Ignite
>  Issue Type: Bug
>Reporter: Roman Puchkovskiy
>Priority: Major
>  Labels: ignite-3, tech-debt
> Fix For: 3.0.0-beta2
>
>
> It was supposed that IGNITE-17931 was the culprit, but even after removing 
> the blocking code the tests are still flaky.
> The tests fail with one of 3 symptoms:
>  # An NPE happens in the test method code: a value by a key for which a put 
> is made earlier is not found when using the same key. This is probably caused 
> by a transactional protocol implementation bug, maybe this: IGNITE-20116
>  # A PrimaryReplicaAwaitTimeoutException
>  # A ReplicationTimeoutException
> Items 2 and 3 need to be investigated.
> h2. A stacktrace for 1
> java.lang.NullPointerException
>     at 
> org.apache.ignite.internal.runner.app.ItSchemaChangeKvViewTest.testDropColumn(ItSchemaChangeKvViewTest.java:58)
> h2. A stacktrace for 2
> org.apache.ignite.tx.TransactionException: IGN-PLACEMENTDRIVER-1 
> TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 Failed to get the primary 
> replica [tablePartitionId=3_part_5, awaitTimestamp=HybridTimestamp 
> [time=111220884095959043, physical=1697096009765, logical=3]]
>  
> at 
> org.apache.ignite.internal.util.ExceptionUtils.lambda$withCause$1(ExceptionUtils.java:400)
> at 
> org.apache.ignite.internal.util.ExceptionUtils.withCauseInternal(ExceptionUtils.java:461)
> at 
> org.apache.ignite.internal.util.ExceptionUtils.withCause(ExceptionUtils.java:400)
> at 
> org.apache.ignite.internal.table.distributed.storage.InternalTableImpl.lambda$enlist$71(InternalTableImpl.java:1659)
> at 
> java.base/java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:930)
> at 
> java.base/java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:907)
> at 
> java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
> at 
> java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088)
> at 
> java.base/java.util.concurrent.CompletableFuture$Timeout.run(CompletableFuture.java:2792)
> at 
> java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
> at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> at 
> java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> at java.base/java.lang.Thread.run(Thread.java:834)
> Caused by: java.util.concurrent.CompletionException: 
> org.apache.ignite.internal.placementdriver.PrimaryReplicaAwaitTimeoutException:
>  IGN-PLACEMENTDRIVER-1 TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 The 
> primary replica await timed out [replicationGroupId=3_part_5, 
> referenceTimestamp=HybridTimestamp [time=111220884095959043, 
> physical=1697096009765, logical=3], currentLease=Lease 
> [leaseholder=isckvt_tmcada_3346, accepted=false, startTime=HybridTimestamp 
> [time=111220884127809550, physical=1697096010251, logical=14], 
> expirationTime=HybridTimestamp [time=111220891992129536, 
> physical=1697096130251, logical=0], prolongable=false, 
> replicationGroupId=3_part_5]]
> at 
> java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:314)
> at 
> java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:319)
> at 
> java.base/java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:990)
> at 
> java.base/java.util.concurrent.CompletableFuture$UniExceptionally.tryFire(CompletableFuture.java:970)
> ... 9 more
> Caused by: 
> org.apache.ignite.internal.placementdriver.PrimaryReplicaAwaitTimeoutException:
>  IGN-PLACEMENTDRIVER-1 TraceId:0a32c369-b9ca-4091-b8de-af15d65a1f52 The 
> primary replica await timed out [replicationGroupId=3_part_5, 
> referenceTimestamp=HybridTimestamp [time=111220884095959043, 
> physical=1697096009765, logical=3], currentLease=Lease 
> [leaseholder=isckvt_tmcada_3346, accepted=false, startTime=HybridTimestamp 
> [time=111220884127809550, physical=1697096010251, logical=14], 
> expirationTime=HybridTimestamp [time=111220891992129536, 
>