Ha! I pulled in Ishan's fixes for 15138 and now AutoscalingHistoryHandlerTest behaves the same as in 8.7! Beasted 10 out of 10 passed, so no @BadApple'ing needed ;-)
On Tue, Feb 16, 2021 at 10:29 AM Anshum Gupta <ans...@anshumgupta.net> wrote: > Yes, doing a single 8.8.2 release that has all the fixes, especially as we > have the fix already is much better for the users. > > Thanks for your patience, Tim :) > > On Tue, Feb 16, 2021 at 9:05 AM Timothy Potter <thelabd...@gmail.com> > wrote: > >> @Ishan ~ Can you look at the question Mike raised about >> https://issues.apache.org/jira/browse/SOLR-15135 please? >> >> So the AutoscalingHistoryHandlerTest has a number of hard-coded wait >> times in it. While I can appreciate the need for waiting to see state >> changes occur, tests like this aren't great for CI and RC smoke tests given >> the variability of hardware. Case in point, I made this change: >> >> ``` >> >> *diff --git >> a/solr/core/src/test/org/apache/solr/handler/admin/AutoscalingHistoryHandlerTest.java >> b/solr/core/src/test/org/apache/solr/handler/admin/AutoscalingHistoryHandlerTest.java* >> >> *index a9eea7f7ca5..3b2d39c3317 100644* >> >> *--- >> a/solr/core/src/test/org/apache/solr/handler/admin/AutoscalingHistoryHandlerTest.java* >> >> *+++ >> b/solr/core/src/test/org/apache/solr/handler/admin/AutoscalingHistoryHandlerTest.java* >> >> @@ -282,7 +282,7 @@ public class AutoscalingHistoryHandlerTest extends >> SolrCloudTestCase { >> >> boolean await = actionFiredLatch.await(60, TimeUnit.SECONDS); >> >> assertTrue("action did not execute", await); >> >> >> >> - await = listenerFiredLatch.await(60, TimeUnit.SECONDS); >> >> + await = listenerFiredLatch.await(120, TimeUnit.SECONDS); >> >> assertTrue("listener did not execute", await); >> >> >> >> waitForRecovery(COLL_NAME); >> ``` >> >> And of course, beasting passes 5 out of 5; it fails pretty consistently >> on the first run w/o this change. So I vote we @BadApple this test for >> 8.8.1 and move forward with RC2 now that Ishan's changes are in. Moreover, >> since we removed auto-scaling from master, holding up a critical bug fix >> for a test that fails intermittently b/c of timing seems imprudent. I'm >> also biased in that I want to get the fix for 15145 out ASAP ;-) >> >> On Tue, Feb 16, 2021 at 9:08 AM Ishan Chattopadhyaya < >> ichattopadhy...@gmail.com> wrote: >> >>> Sounds good, Tim. I've ported the fix to the release branch. Just ran >>> the tests to make sure it works fine. >>> Thanks for the extra work you'll have to do (RC2) in order to save me >>> future work (8.8.2). Really owe you one! >>> >>> > Are there other fixes you're aware of that are slated for 8.8.2 @Ishan >>> Chattopadhyaya <ichattopadhy...@gmail.com>? >>> I am not aware of anything else. >>> >>> On Tue, Feb 16, 2021 at 9:19 PM Timothy Potter <thelabd...@gmail.com> >>> wrote: >>> >>>> I'm beasting AutoscalingHistoryHandlerTest locally now, I haven't seen >>>> that one fail on my side yet. >>>> >>>> As far as respin 8.8.1 RC, it's not a problem for me and I prefer that >>>> to doing an 8.8.2 soon after 8.8.1 comes out. Are there other fixes you're >>>> aware of that are slated for 8.8.2 @Ishan Chattopadhyaya >>>> <ichattopadhy...@gmail.com>? In other words, if the fix for 15138 is >>>> all that will be in 8.8.2, let's just include it in 8.8.1 and hopefully we >>>> won't need an 8.8.2 ;-) >>>> >>>> Tim >>>> >>>> On Tue, Feb 16, 2021 at 7:01 AM Michael Sokolov <msoko...@gmail.com> >>>> wrote: >>>> >>>>> Hmm, I got a failure on >>>>> >>>>> org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory, >>>>> but it did not reproduce (tried twice). Would that possibly also be >>>>> addressed by those fixes? >>>>> >>>>> On Tue, Feb 16, 2021 at 7:38 AM Ishan Chattopadhyaya >>>>> <ichattopadhy...@gmail.com> wrote: >>>>> > >>>>> > > The failure seems to be because of a timeout during collection >>>>> > > creation >>>>> > >>>>> > Thanks for digging in. Seems like that is the exact class of fix >>>>> that we did for SOLR-15138 and are planning for 8.8.2. Shall we backport >>>>> that fix to the release branch now (for RC2 or 8.8.2)? >>>>> > >>>>> > > My h/w is really fast and beefy and may be that's why it doesn't >>>>> get reproduced. >>>>> > Same here, Ryzen 9 5950X (fastest mainstream CPU out there). >>>>> > >>>>> > On Tue, Feb 16, 2021 at 5:36 PM Michael McCandless < >>>>> luc...@mikemccandless.com> wrote: >>>>> >> >>>>> >> Curious, the smoke tester passed for me on the first try: >>>>> >> >>>>> >> SUCCESS! [0:44:29.979512] >>>>> >> >>>>> >> >>>>> >> Mike McCandless >>>>> >> >>>>> >> http://blog.mikemccandless.com >>>>> >> >>>>> >> >>>>> >> On Sun, Feb 14, 2021 at 11:26 AM Timothy Potter < >>>>> thelabd...@apache.org> wrote: >>>>> >>> >>>>> >>> Please vote for release candidate 1 for Lucene/Solr 8.8.1 >>>>> >>> >>>>> >>> >>>>> >>> The artifacts can be downloaded from: >>>>> >>> >>>>> >>> >>>>> https://dist.apache.org/repos/dist/dev/lucene/lucene-solr-8.8.1-RC1-rev6a50a0315ac7e4979abb0b530857c7795bb3b928 >>>>> >>> >>>>> >>> >>>>> >>> You can run the smoke tester directly with this command: >>>>> >>> >>>>> >>> >>>>> >>> python3 -u dev-tools/scripts/smokeTestRelease.py \ >>>>> >>> >>>>> >>> >>>>> https://dist.apache.org/repos/dist/dev/lucene/lucene-solr-8.8.1-RC1-rev6a50a0315ac7e4979abb0b530857c7795bb3b928 >>>>> >>> >>>>> >>> >>>>> >>> The vote will be open for at least 72 hours i.e. until 2021-02-17 >>>>> 17:00 UTC. >>>>> >>> >>>>> >>> >>>>> >>> Here is my +1 ~ SUCCESS! [0:50:06.728441] >>>>> >>> >>>>> >>> >>>>> >>> In addition to the smoke test, I built a Docker image from >>>>> solr-8.8.1.tgz locally and verified: >>>>> >>> >>>>> >>> >>>>> >>> a. A rolling upgrade of a 3-node 8.7.0 cluster to the 8.8.1 RC >>>>> completes successfully w/o any NPEs or weirdness with leader election / >>>>> recoveries. >>>>> >>> >>>>> >>> >>>>> >>> b. The base_url property is stored in replica state after the >>>>> upgrade >>>>> >>> >>>>> >>> >>>>> >>> c. A basic client application built with SolrJ 8.7.0 can load >>>>> cluster state info directly from ZK and query the 8.8.1 RC1 servers. >>>>> >>> >>>>> >>> >>>>> >>> d. Same client app built with SolrJ 8.8.0 works as well. >>>>> >>> >>>>> >>> >>>>> >>> As this bug-fix release is primarily needed to address a SolrJ >>>>> back-compat break (SOLR-15145) and unfortunately our smoke tester >>>>> framework >>>>> does not test for backcompat of older SolrJ against the RC, I ask others >>>>> to >>>>> please test rolling upgrades of servers (ideally multi-node clusters) >>>>> running pre-8.8.0 to this RC if possible. Also, please try client >>>>> applications that are using an older SolrJ, esp. those that load cluster >>>>> state directly from ZK. >>>>> >>> >>>>> >>> >>>>> >>> Best regards, >>>>> >>> >>>>> >>> Tim >>>>> >>> >>>>> >>> >>>>> >>> >>>>> >>> >>>>> >>>>> --------------------------------------------------------------------- >>>>> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org >>>>> For additional commands, e-mail: dev-h...@lucene.apache.org >>>>> >>>>> > > -- > Anshum Gupta >