[ 
https://issues.apache.org/jira/browse/SOLR-15870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17479718#comment-17479718
 ] 

Jan Høydahl commented on SOLR-15870:
------------------------------------

[~ab] I think the problem with this test is here 
[https://github.com/apache/solr/blob/main/solr/core/src/test/org/apache/solr/cloud/api/collections/TestCollectionAPI.java#L356:L362]
 # Random Jetty is killed
 # Wait for that node to be gone from liveNodes (ZK watch?)
 # Immediately request CLUSTERSTATUS and expect it to be not GREEN

I think this is timing related. The ephemeral live_nodes znode will be gone 
really quickly, and the watch will fire. But it will take some time until the 
Overseer has detected it and mutated the CLUSTERSTATE JSON.

I'm going to attempt a {{zkStateReader.waitForState()}} here to guard against 
this. Have a test that seems to work...

> Test TestCollectionAPI fails frequently
> ---------------------------------------
>
>                 Key: SOLR-15870
>                 URL: https://issues.apache.org/jira/browse/SOLR-15870
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: Tests
>    Affects Versions: 9.0
>            Reporter: Jan Høydahl
>            Priority: Major
>
> Sample failure
> Build: https://ci-builds.apache.org/job/Solr/job/Solr-Check-main/2376/
> {code:java}
> 1 tests failed.
> FAILED:  org.apache.solr.cloud.api.collections.TestCollectionAPI.test
> Error Message:
> java.lang.AssertionError: collection health should not be GREEN
> Stack Trace:
> java.lang.AssertionError: collection health should not be GREEN
>       at 
> __randomizedtesting.SeedInfo.seed([2761A750818CC788:AF35988A2F70AA70]:0)
>       at org.junit.Assert.fail(Assert.java:89)
>       at org.junit.Assert.assertTrue(Assert.java:42)
>       at org.junit.Assert.assertFalse(Assert.java:65)
>       at 
> org.apache.solr.cloud.api.collections.TestCollectionAPI.clusterStatusWithCollectionHealthState(TestCollectionAPI.java:362)
>       at 
> org.apache.solr.cloud.api.collections.TestCollectionAPI.test(TestCollectionAPI.java:96)
>  {code}
> Also see graph at 
> http://fucit.org/solr-jenkins-reports/history-trend-of-recent-failures.html#series/org.apache.solr.cloud.api.collections.TestCollectionAPI.test
> Ping [~ab]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to