[
https://issues.apache.org/jira/browse/CASSANDRA-19087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sam Tunnicliffe updated CASSANDRA-19087:
----------------------------------------
Test and Documentation Plan: Run CI suite
Status: Patch Available (was: In Progress)
This appears to be due to in-jvm dtest instances marking internode messaging as
available too early during startup. What's happening here is the test
repeatedly brings one of the nodes down and back up again, while the other
nodes remain up for the duration of the test. When the bouncing node restarts,
it replays the metadata log to reconstruct cluster state. Because of the bug,
the other nodes see it as available before replay has completed and begin
issuing reads and writes to it before it is ready. This is a test-only issue,
specific to the mock internode messaging implementation.
The linked patch moves the setting of the {{internodeMessagingStarted}} flag to
the right place. CI summary and results archive attached.
https://github.com/beobal/cassandra/commit/2503b0794bc4e7c7a5b4f7007518b9fe17118238
> Test Failure:
> org.apache.cassandra.distributed.test.log.FetchLogFromPeersTest.testSchema-_jdk11
> -----------------------------------------------------------------------------------------------
>
> Key: CASSANDRA-19087
> URL: https://issues.apache.org/jira/browse/CASSANDRA-19087
> Project: Cassandra
> Issue Type: Bug
> Components: CI
> Reporter: Michael Semb Wever
> Assignee: Sam Tunnicliffe
> Priority: Normal
> Fix For: 5.1-alpha1
>
> Attachments: ci_summary.html, result_details.tar.gz
>
>
> In j11_jvm_dtests_vnode, ref CASSANDRA-19034
> https://app.circleci.com/pipelines/github/mike-tr-adamson/cassandra/402/workflows/92aacb84-fd3a-48e0-9fb2-d1e2fe6fc71a/jobs/35324/tests
> {noformat}
> org.apache.cassandra.distributed.shared.ShutdownException: Uncaught
> exceptions were thrown during test
> at
> org.apache.cassandra.distributed.impl.AbstractCluster.checkAndResetUncaughtExceptions(AbstractCluster.java:1124)
> at
> org.apache.cassandra.distributed.impl.AbstractCluster.close(AbstractCluster.java:1110)
> at
> org.apache.cassandra.distributed.test.log.FetchLogFromPeersTest.testSchema(FetchLogFromPeersTest.java:75)
> at
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at
> java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> Suppressed: java.lang.IllegalStateException: Could not find range for
> token 356242581507269238 in PlacementForRange: {}
> at
> org.apache.cassandra.tcm.ownership.PlacementForRange.forToken(PlacementForRange.java:119)
> at
> org.apache.cassandra.db.ReadCommandVerbHandler.getLocalReplica(ReadCommandVerbHandler.java:218)
> at
> org.apache.cassandra.db.ReadCommandVerbHandler.checkTokenOwnership(ReadCommandVerbHandler.java:159)
> at
> org.apache.cassandra.db.ReadCommandVerbHandler.doVerb(ReadCommandVerbHandler.java:57)
> at
> org.apache.cassandra.net.InboundSink.lambda$new$0(InboundSink.java:102)
> at
> org.apache.cassandra.net.InboundSink.accept(InboundSink.java:122)
> at
> org.apache.cassandra.distributed.impl.Instance.lambda$receiveMessageRunnable$6(Instance.java:538)
> at
> org.apache.cassandra.concurrent.ExecutionFailure$1.run(ExecutionFailure.java:133)
> at
> org.apache.cassandra.concurrent.SEPWorker.run(SEPWorker.java:143)
> at
> io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30)
> at java.base/java.lang.Thread.run(Thread.java:829)
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]