[
https://issues.apache.org/jira/browse/SOLR-13065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16719267#comment-16719267
]
Jason Gerlowski commented on SOLR-13065:
----------------------------------------
At first glance, this looks like a similar problem to what I recently saw in
SOLR-13045. The test fails in a {{waitForState}} block, but there's some
indication that we're using an outdated (cached?) copy of the clusterstatus
info.
Here's a partial stack from a recent failure I got:
{code}
[beaster] 2> NOTE: reproduce with: ant test
-Dtestcase=TestSimExecutePlanAction -Dtests.method=testIntegration
-Dtests.seed=18902C9108C137F1 -Dtests.slow=true -Dtests.badapples=true
-Dtests.locale=es-GT -Dtests.timezone=Asia/Rangoon -Dtests.asserts=true
-Dtests.file.encoding=UTF-8
[beaster] 2> 24745 INFO (simCloudManagerPool-112-thread-8) [ ]
o.a.s.c.CloudTestUtils -- wrong number of active replicas in slice shard1,
expected=1, found=2
[beaster] [12:26:46.105] FAILURE 2.13s |
TestSimExecutePlanAction.testIntegration
{seed=[18902C9108C137F1:7163CC06353074F9]} <<<
[beaster] > Throwable #1: java.lang.AssertionError: Timed out waiting for
replicas of collection to be 2 again
[beaster] > Live Nodes: [127.0.0.1:10016_solr]
[beaster] > Last available state:
DocCollection(testIntegration//clusterstate.json/444)={
...
[beaster] > at
__randomizedtesting.SeedInfo.seed([18902C9108C137F1:7163CC06353074F9]:0)
[beaster] > at
org.apache.solr.cloud.CloudTestUtils.waitForState(CloudTestUtils.java:70)
[beaster] > at
org.apache.solr.cloud.autoscaling.sim.TestSimExecutePlanAction.testIntegration(TestSimExecutePlanAction.java:200
...
[beaster] > Caused by: java.util.concurrent.TimeoutException: last
ClusterState: znodeVersion: 445
{code}
Note the different reported "last" clusterstate versions. We see that there's
a clusterstate.json version 445, but the failing assertion only has 444.
That's not to say definitively that version 445 would pass the assertion, but
it's a place to start.
> Harden TestSimExecuteActionPlan
> -------------------------------
>
> Key: SOLR-13065
> URL: https://issues.apache.org/jira/browse/SOLR-13065
> Project: Solr
> Issue Type: Test
> Security Level: Public(Default Security Level. Issues are Public)
> Affects Versions: master (8.0)
> Reporter: Jason Gerlowski
> Assignee: Jason Gerlowski
> Priority: Major
>
> TestSimExecuteActionPlan is a serial offender in our failed Jenkins jobs.
> Would like to look into improving it.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]