[ 
https://issues.apache.org/jira/browse/CASSANDRA-18707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17766905#comment-17766905
 ] 

Doug Rohrer commented on CASSANDRA-18707:
-----------------------------------------

{quote}Doug Rohrer , I noticed you were the one who tested and extended the 
wait time from 60 to 70 due to a race with a hard-coded 60-second timeout in 
MigrationManager MIGRATION_DELAY_IN_MS. Any reason why 140 wouldn't be a good 
idea here? {quote}

I'd suggest that I should have made it a configurable value back then, and we 
should probably make it configurable now. Bumping the default to 140 seems 
reasonable, but I think we should make it easier for folks with their own CI 
environment to potentially bump it if they need to without having to change the 
code again? Thoughts on adding a new property and defaulting _that_ to 140?

I do hate to just bump the timeout without understanding why the cluster just 
hung for 47 seconds, but if we've tried to reproduce it a bunch of times and 
it's just not happening it seems like the only reasonable option for now. By 
making it configurable, we can try locally to reproduce and diagnose the root 
cause (if there is one)

> Test failure: 
> junit.framework.TestSuite.org.apache.cassandra.distributed.test.CASMultiDCTest-.jdk11
>  
> ----------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-18707
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-18707
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Test/dtest/java
>            Reporter: Ekaterina Dimitrova
>            Assignee: Berenguer Blasi
>            Priority: Normal
>             Fix For: 4.0.x, 4.1.x, 5.0.x, 5.x
>
>         Attachments: TESTS-TestSuites.xml.xz
>
>
> Seen here:
> [https://ci-cassandra.apache.org/job/Cassandra-trunk/1650/testReport/junit.framework/TestSuite/org_apache_cassandra_distributed_test_CASMultiDCTest__jdk11/]
> h3.  
> {code:java}
> Error Message
> Schema agreement not reached. Schema versions of the instances: 
> [ef1c8e05-a06d-388d-a46d-53cc22a94762, 6c386108-1805-3985-b48e-8016012a0207, 
> 6c386108-1805-3985-b48e-8016012a0207, ef1c8e05-a06d-388d-a46d-53cc22a94762]
> Stacktrace
> java.lang.IllegalStateException: Schema agreement not reached. Schema 
> versions of the instances: [ef1c8e05-a06d-388d-a46d-53cc22a94762, 
> 6c386108-1805-3985-b48e-8016012a0207, 6c386108-1805-3985-b48e-8016012a0207, 
> ef1c8e05-a06d-388d-a46d-53cc22a94762] at 
> org.apache.cassandra.distributed.impl.AbstractCluster$ChangeMonitor.waitForCompletion(AbstractCluster.java:907)
>  at 
> org.apache.cassandra.distributed.impl.AbstractCluster.lambda$schemaChange$8(AbstractCluster.java:836)
>  at org.apache.cassandra.concurrent.FutureTask$1.call(FutureTask.java:96) at 
> org.apache.cassandra.concurrent.FutureTask.call(FutureTask.java:61) at 
> org.apache.cassandra.concurrent.FutureTask.run(FutureTask.java:71) at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>  at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>  at 
> io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30)
>  at java.base/java.lang.Thread.run(Thread.java:829)
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to