[ 
https://issues.apache.org/jira/browse/CASSANDRA-11197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15156738#comment-15156738
 ] 

Sylvain Lebresne commented on CASSANDRA-11197:
----------------------------------------------

Actually, seems that error is logged when, on bootstrap, migration hasn't 
finished after {{cassandra.migration_task_wait_in_seconds}} which is one second 
by default (this is new code from CASSANDRA-10731). And I can't help notice 
that all the failures of this I've seen are from "node5" which means the test 
use at least 5 nodes, which probably make CI slow down substantially make the 
default of 1 seconds likely too low.

So ideally we'd really want to bump 
{{cassandra.migration_task_wait_in_seconds}} when there is more nodes. Maybe 
that's even something that can be done automatically by ccm 
[~philipthompson][~mambocab][~rhatch].

> upgrade bootstrap tests flap when migration tasks fail
> ------------------------------------------------------
>
>                 Key: CASSANDRA-11197
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-11197
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Jim Witschey
>            Assignee: DS Test Eng
>              Labels: dtest
>
> I've seen these tests flap:
> {code}
> upgrade_tests/upgrade_through_versions_test.py:ProtoV4Upgrade_3_1_UpTo_3_2_HEAD.bootstrap_test
> upgrade_tests/upgrade_through_versions_test.py:ProtoV4Upgrade_3_3_UpTo_Trunk_HEAD.bootstrap_test
> upgrade_tests/upgrade_through_versions_test.py:ProtoV3Upgrade_3_0_UpTo_3_1_HEAD.bootstrap_multidc_test
> upgrade_tests/upgrade_through_versions_test.py:ProtoV3Upgrade_3_2_UpTo_3_3_HEAD.bootstrap_multidc_test
> upgrade_tests/upgrade_through_versions_test.py:ProtoV3Upgrade_3_3_UpTo_Trunk_HEAD.bootstrap_multidc_test
> upgrade_tests/upgrade_through_versions_test.py:ProtoV4Upgrade_3_3_UpTo_Trunk_HEAD.bootstrap_multidc_test
> {code}
> There may be more upgrade paths that flap, I'm not sure. All the failures 
> I've seen look like this:
> {code}
> Unexpected error in node5 node log: ['ERROR [main] 2016-02-18 20:05:13,012 
> MigrationManager.java:164 - Migration task failed to complete\nERROR [main] 
> 2016-02-18 20:05:14,012 MigrationManager.java:164 - Migration task failed to 
> complete']
> {code}
> [~rhatch] Do these look familiar at all?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to