Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-06 Thread Michael McCandless
Phew!  Thanks for digging Erick, and for producing these BadApple reports.

Mike McCandless

http://blog.mikemccandless.com


On Wed, May 6, 2020 at 7:59 AM Erick Erickson 
wrote:

> OK, this morning things are back to normal. I think the disk space issue
> was to blame because checking after Mike’s fix didn’t look like it
> cured the problem.
>
> Thanks all!
>
> > On May 5, 2020, at 1:41 PM, Chris Hostetter 
> wrote:
> >
> >
> > : And FWIW, I beasted one of the failing suites last night _without_
> > : Mike’s changes and didn’t get any failures so I can’t say anything
> about
> > : whether Mike’s changes helped or not.
> >
> > IIUC McCandless's failure only affects you if you use the "jenkins" test
> > data file (the really big wikipedia dump) ... see the jira he mentioned
> > for details.
> >
> >
> >
> > -Hoss
> > http://www.lucidworks.com/
> >
> > -
> > To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> > For additional commands, e-mail: dev-h...@lucene.apache.org
>
>
> -
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org
>
>


Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-06 Thread Erick Erickson
OK, this morning things are back to normal. I think the disk space issue
was to blame because checking after Mike’s fix didn’t look like it
cured the problem.

Thanks all!

> On May 5, 2020, at 1:41 PM, Chris Hostetter  wrote:
> 
> 
> : And FWIW, I beasted one of the failing suites last night _without_ 
> : Mike’s changes and didn’t get any failures so I can’t say anything about 
> : whether Mike’s changes helped or not.
> 
> IIUC McCandless's failure only affects you if you use the "jenkins" test 
> data file (the really big wikipedia dump) ... see the jira he mentioned 
> for details.
> 
> 
> 
> -Hoss
> http://www.lucidworks.com/
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-05 Thread Erick Erickson
OK, thanks Chris. 

The 24 hour rollup still shows many failures in the several classes, I’ll check 
tomorrow
to see if that’s a consequence of the disk full problem.

> On May 5, 2020, at 1:41 PM, Chris Hostetter  wrote:
> 
> 
> : And FWIW, I beasted one of the failing suites last night _without_ 
> : Mike’s changes and didn’t get any failures so I can’t say anything about 
> : whether Mike’s changes helped or not.
> 
> IIUC McCandless's failure only affects you if you use the "jenkins" test 
> data file (the really big wikipedia dump) ... see the jira he mentioned 
> for details.
> 
> 
> 
> -Hoss
> http://www.lucidworks.com/
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-05 Thread Chris Hostetter

: And FWIW, I beasted one of the failing suites last night _without_ 
: Mike’s changes and didn’t get any failures so I can’t say anything about 
: whether Mike’s changes helped or not.

IIUC McCandless's failure only affects you if you use the "jenkins" test 
data file (the really big wikipedia dump) ... see the jira he mentioned 
for details.



-Hoss
http://www.lucidworks.com/

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-05 Thread Erick Erickson
 UNLOAD) 
[n:127.0.0.1:49613_solrx:replicaTypesTestColl_shard1_replica_p4 ] 
o.a.s.m.r.SolrJmxReporter Closing reporter 
[org.apache.solr.metrics.reporters.SolrJmxReporter@1f2a6e95: rootName = 
solr_49613, domain = solr.core.replicaTypesTestColl.shard1.replica_p4, service 
url = null, agent id = null] for registry 
solr.core.replicaTypesTestColl.shard1.replica_p4/com.codahale.metrics.MetricRegistry@2edb03e2
 [junit4]   2> 33770 ERROR (indexFetcher-621-thread-1) [n:127.0.0.1:49612_solr  
   ] o.a.s.h.ReplicationHandler Index fetch failed 
:java.lang.NullPointerException
   [junit4]   2>at 
org.apache.solr.handler.IndexFetcher.getLeaderReplica(IndexFetcher.java:709)
   [junit4]   2>at 
org.apache.solr.handler.IndexFetcher.fetchLatestIndex(IndexFetcher.java:387)
   [junit4]   2>at 
org.apache.solr.handler.IndexFetcher.fetchLatestIndex(IndexFetcher.java:351)
   [junit4]   2>at 
org.apache.solr.handler.ReplicationHandler.doFetch(ReplicationHandler.java:422)
   [junit4]   2>at 
org.apache.solr.handler.ReplicationHandler.lambda$setupPolling$13(ReplicationHandler.java:1208)
   [junit4]   2>at 
java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
   [junit4]   2>at 
java.base/java.util.concurrent.FutureTask.runAndReset(FutureTask.java:305)
   [junit4]   2>at 
java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:305)
   [junit4]   2>at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
   [junit4]   2>at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
   [junit4]   2>at java.base/java.lang.Thread.run(Thread.java:834)
   [junit4]   2> 

> On May 5, 2020, at 4:33 AM, Uwe Schindler  wrote:
> 
> Hi,
> 
> there was also a problem with the Windows Node. It ran out of disk space, 
> because some test seem to have filled up all of the disk. All followup builds 
> failed. I cleaned all Workspaces (8.x, master) and it freed 20 Gigabytes!
> 
> Uwe
> 
> -
> Uwe Schindler
> Achterdiek 19, D-28357 Bremen
> https://www.thetaphi.de
> eMail: u...@thetaphi.de
> 
>> -Original Message-
>> From: Erick Erickson 
>> Sent: Monday, May 4, 2020 1:54 PM
>> To: dev@lucene.apache.org
>> Subject: PLEASE READ! BadApple report. Last week was horrible!
>> 
>> I don’t know whether we had some temporary glitch that broke lots of tests
>> and they’ve been fixed or we had a major regression, but this needs to be
>> addressed ASAP if they’re still failing. See everything below the line "ALL 
>> OF
>> THE TESTS BELOW HERE HAVE ONLY FAILED IN THE LAST WEEK!” in this e-mail.
>> I’ll raise a JIRA if we can’t get some traction quickly here.
>> 
>> Hey, stuff happens. there’s no problem with tests going totally weird for a
>> while. If you can say “Oh, yeah, all those failures for class XYZ are 
>> probably
>> fixed” that’s fine.
>> 
>> Gosh-a-rooni, I hope my logging changes aren’t the culprit (gulp)….
>> 
>> Hoss’ rolllup for the last 24 hours is not encouraging in terms of the
>> problem already being fixed. There are lots of failures in some
>> classes, notably:
>> 
>> CloudHttp2SolrClientTest
>> CollectionsAPIDistributedZkTest
>> DeleteReplicaTest
>> TestDocCollectionWatcher
>> 
>> Unfortunately, the failure rate is not very high so reliably
>> reproducing is hard.
>> 
>> I’ve reproduced the last week’s failure in this e-mail, full
>> report attached.
>> 
>> Here’s Hoss’ rollup:
>> http://fucit.org/solr-jenkins-reports/failure-report.html
>> 
>> Usual synopsis:
>> 
>> Raw fail count by week totals, most recent week first (corresponds to bits):
>> Week: 0  had  343 failures
>> Week: 1  had  86 failures
>> Week: 2  had  78 failures
>> Week: 3  had  117 failures
>> 
>> 
>> Failures in Hoss' reports for the last 4 rollups.
>> 
>> There were 497 unannotated tests that failed in Hoss' rollups. Ordered by the
>> date I downloaded the rollup file, newest->oldest. See above for the dates 
>> the
>> files were collected
>> These tests were NOT BadApple'd or AwaitsFix’d
>> 
>> Failures in the last 4 reports..
>>   Report   Pct runsfails   test
>> 0123   0.7 1617 11
>> ConnectionManagerTest.testReconnectWhenZkDisappeared
>> 0123   1.5 1606 12  ExecutePlanActionTest.testTaskTimeout
>> 0123   1.6 1320 19  MultiThreadedOCPTest.test
>> 0123   1.0 

RE: PLEASE READ! BadApple report. Last week was horrible!

2020-05-05 Thread Uwe Schindler
Hi,

there was also a problem with the Windows Node. It ran out of disk space, 
because some test seem to have filled up all of the disk. All followup builds 
failed. I cleaned all Workspaces (8.x, master) and it freed 20 Gigabytes!

Uwe

-
Uwe Schindler
Achterdiek 19, D-28357 Bremen
https://www.thetaphi.de
eMail: u...@thetaphi.de

> -Original Message-
> From: Erick Erickson 
> Sent: Monday, May 4, 2020 1:54 PM
> To: dev@lucene.apache.org
> Subject: PLEASE READ! BadApple report. Last week was horrible!
> 
> I don’t know whether we had some temporary glitch that broke lots of tests
> and they’ve been fixed or we had a major regression, but this needs to be
> addressed ASAP if they’re still failing. See everything below the line "ALL OF
> THE TESTS BELOW HERE HAVE ONLY FAILED IN THE LAST WEEK!” in this e-mail.
> I’ll raise a JIRA if we can’t get some traction quickly here.
> 
> Hey, stuff happens. there’s no problem with tests going totally weird for a
> while. If you can say “Oh, yeah, all those failures for class XYZ are probably
> fixed” that’s fine.
> 
> Gosh-a-rooni, I hope my logging changes aren’t the culprit (gulp)….
> 
> Hoss’ rolllup for the last 24 hours is not encouraging in terms of the
> problem already being fixed. There are lots of failures in some
> classes, notably:
> 
> CloudHttp2SolrClientTest
> CollectionsAPIDistributedZkTest
> DeleteReplicaTest
> TestDocCollectionWatcher
> 
> Unfortunately, the failure rate is not very high so reliably
> reproducing is hard.
> 
> I’ve reproduced the last week’s failure in this e-mail, full
> report attached.
> 
> Here’s Hoss’ rollup:
> http://fucit.org/solr-jenkins-reports/failure-report.html
> 
> Usual synopsis:
> 
> Raw fail count by week totals, most recent week first (corresponds to bits):
> Week: 0  had  343 failures
> Week: 1  had  86 failures
> Week: 2  had  78 failures
> Week: 3  had  117 failures
> 
> 
> Failures in Hoss' reports for the last 4 rollups.
> 
> There were 497 unannotated tests that failed in Hoss' rollups. Ordered by the
> date I downloaded the rollup file, newest->oldest. See above for the dates the
> files were collected
> These tests were NOT BadApple'd or AwaitsFix’d
> 
> Failures in the last 4 reports..
>Report   Pct runsfails   test
>  0123   0.7 1617 11
> ConnectionManagerTest.testReconnectWhenZkDisappeared
>  0123   1.5 1606 12  ExecutePlanActionTest.testTaskTimeout
>  0123   1.6 1320 19  MultiThreadedOCPTest.test
>  0123   1.0 1620 13  RollingRestartTest.test
>  0123   1.2 1617 12  SearchRateTriggerTest.testWaitForElapsed
>  0123   3.8  119  7  ShardSplitTest.testSplitWithChaosMonkey
>  0123   0.3 1519  7  TestInPlaceUpdatesDistrib.test
>  0123   0.7 1629 14  
> TestIndexWriterDelete.testDeleteAllNoDeadLock
>  0123   2.4 1548 18  TestPackages.testPluginLoading
>  0123   0.3 1587  4  UnloadDistributedZkTest.test
> 
> 
> FAILURES IN THE LAST WEEK (343!)
> Look particularly at the ones with only a zero in the “Report” column, those 
> are
> failures that were _not_ in the previous 3 week’s rollups.
> 
>Report   Pct runsfails   test
>  0120.5 1165  4  CustomHighlightComponentTest.test
>  0121.0 1168  6
> NodeMarkersRegistrationTest.testNodeMarkersRegistration
>  0121.0 1170  8  TestCryptoKeys.test
>  01 3   0.7 1233 11  LeaderFailoverAfterPartitionTest.test
>  01 3  63.2  102 39  StressHdfsTest.test
>  01 0.3  709  2
> ScheduledTriggerIntegrationTest.testScheduledTrigger
>  01 0.2  768  2  ShardRoutingTest.test
>  01 2.6  807 22  TestAllFilesHaveChecksumFooter.test
>  01 2.6  808 22  TestAllFilesHaveCodecHeader.test
>  01 0.2  769  2  TestCloudSchemaless.test
>  01 0.2  769  2  TestDynamicLoading.testDynamicLoading
>  01 0.3  707  2  
> TestDynamicLoadingUrl.testDynamicLoadingUrl
>  01 0.5  767  4  TestPointFields.testFloatPointStats
>  0127.1   83 19  TestSQLHandler.doTest
>  01 0.2  794 12  TestSameScoresWithThreads.test
>  01 2.6  806 22  TestShardSearching.testSimple
>  01 0.5  726  4  TestSimScenario.testSplitShard
>  01 1.1  726  7  TestSimScenario.testSuggestions
>  01 0.3  771  2  TestWithCollection.testAddRe

Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-04 Thread Erick Erickson
Mike:

I saw the push. Hoss’ rollups go for “the last 24 hours”, so it’ll be Tuesday 
evening before things have had a chance to work their way through, I’ll look 
tomorrow.

Meanwhile I’m beasting one of the failing test suites (without the change) and 
280 iterations so far and no failures. That said, the failure rate was < 1% so 
it’s not conclusive. Only another 720 runs to go before I pull the latest 
changes and try again… ;)



> On May 4, 2020, at 1:33 PM, Michael McCandless  
> wrote:
> 
> Hi Erick,
> 
> OK I pushed a fix!  See if it decreases the failure rate for those newly bad 
> apples?
> 
> Sorry and thanks :)
> 
> Mike McCandless
> 
> http://blog.mikemccandless.com
> 
> 
> On Mon, May 4, 2020 at 1:06 PM Erick Erickson  wrote:
> Mike:
> 
> I have no idea. Hoss’ rollups don’t link back to builds, they
> just aggregate the results.
> 
> Not a huge deal if it’s something like this of course. Let’s just
> say I’ve had my share or “moments” ;).
> 
> And unfortunately, the test failures are pretty rare on a 
> percentage basis, so it’s hard to tell.
> 
> I’m watching LUCENE-9191 and I’ll look back at Hoss’ rollups
> a day after you push it and see if the failures disappear.
> 
> It’ll take a while for the fixes to roll through all the reporting.
> 
> Tell you what. I’ll try beasting one of the classes that fails a lot and then
> try it again after you push LUCENE-9191 and we’ll go from there.
> 
> Thanks for getting into this so promptly!
> 
> Erick
> 
> > On May 4, 2020, at 9:10 AM, Michael McCandless  
> > wrote:
> > 
> > Hi Erick,
> > 
> > It's possible this was the root cause of many of the failures: 
> > https://issues.apache.org/jira/browse/LUCENE-9191
> > 
> > Do these transient failures look something like this?
> > 
> >[junit4]> Throwable #1: java.nio.charset.MalformedInputException: 
> > Input length = 1
> >[junit4]>at 
> > __randomizedtesting.SeedInfo.seed([172C6414BE5E2A2C:E5829DFC005A1F0]:0)
> >[junit4]>at 
> > java.base/java.nio.charset.CoderResult.throwException(CoderResult.java:274)
> >[junit4]>at 
> > java.base/sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:339)
> >[junit4]>at 
> > java.base/sun.nio.cs.StreamDecoder.read(StreamDecoder.java:178)
> >[junit4]>at 
> > java.base/java.io.InputStreamReader.read(InputStreamReader.java:185)
> >[junit4]>at 
> > java.base/java.io.BufferedReader.fill(BufferedReader.java:161)
> >[junit4]>at 
> > java.base/java.io.BufferedReader.readLine(BufferedReader.java:326)
> >[junit4]>at 
> > java.base/java.io.BufferedReader.readLine(BufferedReader.java:392)
> >[junit4]>at 
> > org.apache.lucene.util.LineFileDocs.open(LineFileDocs.java:175)
> >[junit4]>at 
> > org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:65)
> >[junit4]>at 
> > org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:69)
> > 
> > 
> > If so, then it is likely the root cause ... I'm working on a fix.  Sorry!
> > 
> > Mike McCandless
> > 
> > http://blog.mikemccandless.com
> > 
> > 
> > On Mon, May 4, 2020 at 7:54 AM Erick Erickson  
> > wrote:
> > I don’t know whether we had some temporary glitch that broke lots of tests 
> > and they’ve been fixed or we had a major regression, but this needs to be 
> > addressed ASAP if they’re still failing. See everything below the line "ALL 
> > OF THE TESTS BELOW HERE HAVE ONLY FAILED IN THE LAST WEEK!” in this e-mail. 
> > I’ll raise a JIRA if we can’t get some traction quickly here.
> > 
> > Hey, stuff happens. there’s no problem with tests going totally weird for a 
> > while. If you can say “Oh, yeah, all those failures for class XYZ are 
> > probably fixed” that’s fine.
> > 
> > Gosh-a-rooni, I hope my logging changes aren’t the culprit (gulp)….
> > 
> > Hoss’ rolllup for the last 24 hours is not encouraging in terms of the
> > problem already being fixed. There are lots of failures in some
> > classes, notably:
> > 
> > CloudHttp2SolrClientTest
> > CollectionsAPIDistributedZkTest
> > DeleteReplicaTest
> > TestDocCollectionWatcher
> > 
> > Unfortunately, the failure rate is not very high so reliably 
> > reproducing is hard.
> > 
> > I’ve reproduced the last week’s failure in this e-mail, full 
> > report attached. 
> > 
> > Here’s Hoss’ rollup:
> > http://fucit.org/solr-jenkins-reports/failure-report.html
> > 
> > Usual synopsis:
> > 
> > Raw fail count by week totals, most recent week first (corresponds to bits):
> > Week: 0  had  343 failures
> > Week: 1  had  86 failures
> > Week: 2  had  78 failures
> > Week: 3  had  117 failures
> > 
> > 
> > Failures in Hoss' reports for the last 4 rollups.
> > 
> > There were 497 unannotated tests that failed in Hoss' rollups. Ordered by 
> > the date I downloaded the rollup file, newest->oldest. See above for the 
> > dates the files were collected 
> > These tests were NOT BadApple'd or AwaitsFix’d
> 

Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-04 Thread Michael McCandless
Hi Erick,

OK I pushed a fix!  See if it decreases the failure rate for those newly
bad apples?

Sorry and thanks :)

Mike McCandless

http://blog.mikemccandless.com


On Mon, May 4, 2020 at 1:06 PM Erick Erickson 
wrote:

> Mike:
>
> I have no idea. Hoss’ rollups don’t link back to builds, they
> just aggregate the results.
>
> Not a huge deal if it’s something like this of course. Let’s just
> say I’ve had my share or “moments” ;).
>
> And unfortunately, the test failures are pretty rare on a
> percentage basis, so it’s hard to tell.
>
> I’m watching LUCENE-9191 and I’ll look back at Hoss’ rollups
> a day after you push it and see if the failures disappear.
>
> It’ll take a while for the fixes to roll through all the reporting.
>
> Tell you what. I’ll try beasting one of the classes that fails a lot and
> then
> try it again after you push LUCENE-9191 and we’ll go from there.
>
> Thanks for getting into this so promptly!
>
> Erick
>
> > On May 4, 2020, at 9:10 AM, Michael McCandless <
> luc...@mikemccandless.com> wrote:
> >
> > Hi Erick,
> >
> > It's possible this was the root cause of many of the failures:
> https://issues.apache.org/jira/browse/LUCENE-9191
> >
> > Do these transient failures look something like this?
> >
> >[junit4]> Throwable #1: java.nio.charset.MalformedInputException:
> Input length = 1
> >[junit4]>at
> __randomizedtesting.SeedInfo.seed([172C6414BE5E2A2C:E5829DFC005A1F0]:0)
> >[junit4]>at
> java.base/java.nio.charset.CoderResult.throwException(CoderResult.java:274)
> >[junit4]>at
> java.base/sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:339)
> >[junit4]>at
> java.base/sun.nio.cs.StreamDecoder.read(StreamDecoder.java:178)
> >[junit4]>at java.base/java.io
> .InputStreamReader.read(InputStreamReader.java:185)
> >[junit4]>at java.base/java.io
> .BufferedReader.fill(BufferedReader.java:161)
> >[junit4]>at java.base/java.io
> .BufferedReader.readLine(BufferedReader.java:326)
> >[junit4]>at java.base/java.io
> .BufferedReader.readLine(BufferedReader.java:392)
> >[junit4]>at
> org.apache.lucene.util.LineFileDocs.open(LineFileDocs.java:175)
> >[junit4]>at
> org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:65)
> >[junit4]>at
> org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:69)
> >
> >
> > If so, then it is likely the root cause ... I'm working on a fix.  Sorry!
> >
> > Mike McCandless
> >
> > http://blog.mikemccandless.com
> >
> >
> > On Mon, May 4, 2020 at 7:54 AM Erick Erickson 
> wrote:
> > I don’t know whether we had some temporary glitch that broke lots of
> tests and they’ve been fixed or we had a major regression, but this needs
> to be addressed ASAP if they’re still failing. See everything below the
> line "ALL OF THE TESTS BELOW HERE HAVE ONLY FAILED IN THE LAST WEEK!” in
> this e-mail. I’ll raise a JIRA if we can’t get some traction quickly here.
> >
> > Hey, stuff happens. there’s no problem with tests going totally weird
> for a while. If you can say “Oh, yeah, all those failures for class XYZ are
> probably fixed” that’s fine.
> >
> > Gosh-a-rooni, I hope my logging changes aren’t the culprit (gulp)….
> >
> > Hoss’ rolllup for the last 24 hours is not encouraging in terms of the
> > problem already being fixed. There are lots of failures in some
> > classes, notably:
> >
> > CloudHttp2SolrClientTest
> > CollectionsAPIDistributedZkTest
> > DeleteReplicaTest
> > TestDocCollectionWatcher
> >
> > Unfortunately, the failure rate is not very high so reliably
> > reproducing is hard.
> >
> > I’ve reproduced the last week’s failure in this e-mail, full
> > report attached.
> >
> > Here’s Hoss’ rollup:
> > http://fucit.org/solr-jenkins-reports/failure-report.html
> >
> > Usual synopsis:
> >
> > Raw fail count by week totals, most recent week first (corresponds to
> bits):
> > Week: 0  had  343 failures
> > Week: 1  had  86 failures
> > Week: 2  had  78 failures
> > Week: 3  had  117 failures
> >
> >
> > Failures in Hoss' reports for the last 4 rollups.
> >
> > There were 497 unannotated tests that failed in Hoss' rollups. Ordered
> by the date I downloaded the rollup file, newest->oldest. See above for the
> dates the files were collected
> > These tests were NOT BadApple'd or AwaitsFix’d
> >
> > Failures in the last 4 reports..
> >Report   Pct runsfails   test
> >  0123   0.7 1617 11
> ConnectionManagerTest.testReconnectWhenZkDisappeared
> >  0123   1.5 1606 12
> ExecutePlanActionTest.testTaskTimeout
> >  0123   1.6 1320 19  MultiThreadedOCPTest.test
> >  0123   1.0 1620 13  RollingRestartTest.test
> >  0123   1.2 1617 12
> SearchRateTriggerTest.testWaitForElapsed
> >  0123   3.8  119  7
> ShardSplitTest.testSplitWithChaosMonkey
> >  0123   0.3 1519  7  

Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-04 Thread Erick Erickson
Mike:

I have no idea. Hoss’ rollups don’t link back to builds, they
just aggregate the results.

Not a huge deal if it’s something like this of course. Let’s just
say I’ve had my share or “moments” ;).

And unfortunately, the test failures are pretty rare on a 
percentage basis, so it’s hard to tell.

I’m watching LUCENE-9191 and I’ll look back at Hoss’ rollups
a day after you push it and see if the failures disappear.

It’ll take a while for the fixes to roll through all the reporting.

Tell you what. I’ll try beasting one of the classes that fails a lot and then
try it again after you push LUCENE-9191 and we’ll go from there.

Thanks for getting into this so promptly!

Erick

> On May 4, 2020, at 9:10 AM, Michael McCandless  
> wrote:
> 
> Hi Erick,
> 
> It's possible this was the root cause of many of the failures: 
> https://issues.apache.org/jira/browse/LUCENE-9191
> 
> Do these transient failures look something like this?
> 
>[junit4]> Throwable #1: java.nio.charset.MalformedInputException: 
> Input length = 1
>[junit4]>at 
> __randomizedtesting.SeedInfo.seed([172C6414BE5E2A2C:E5829DFC005A1F0]:0)
>[junit4]>at 
> java.base/java.nio.charset.CoderResult.throwException(CoderResult.java:274)
>[junit4]>at 
> java.base/sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:339)
>[junit4]>at 
> java.base/sun.nio.cs.StreamDecoder.read(StreamDecoder.java:178)
>[junit4]>at 
> java.base/java.io.InputStreamReader.read(InputStreamReader.java:185)
>[junit4]>at 
> java.base/java.io.BufferedReader.fill(BufferedReader.java:161)
>[junit4]>at 
> java.base/java.io.BufferedReader.readLine(BufferedReader.java:326)
>[junit4]>at 
> java.base/java.io.BufferedReader.readLine(BufferedReader.java:392)
>[junit4]>at 
> org.apache.lucene.util.LineFileDocs.open(LineFileDocs.java:175)
>[junit4]>at 
> org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:65)
>[junit4]>at 
> org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:69)
> 
> 
> If so, then it is likely the root cause ... I'm working on a fix.  Sorry!
> 
> Mike McCandless
> 
> http://blog.mikemccandless.com
> 
> 
> On Mon, May 4, 2020 at 7:54 AM Erick Erickson  wrote:
> I don’t know whether we had some temporary glitch that broke lots of tests 
> and they’ve been fixed or we had a major regression, but this needs to be 
> addressed ASAP if they’re still failing. See everything below the line "ALL 
> OF THE TESTS BELOW HERE HAVE ONLY FAILED IN THE LAST WEEK!” in this e-mail. 
> I’ll raise a JIRA if we can’t get some traction quickly here.
> 
> Hey, stuff happens. there’s no problem with tests going totally weird for a 
> while. If you can say “Oh, yeah, all those failures for class XYZ are 
> probably fixed” that’s fine.
> 
> Gosh-a-rooni, I hope my logging changes aren’t the culprit (gulp)….
> 
> Hoss’ rolllup for the last 24 hours is not encouraging in terms of the
> problem already being fixed. There are lots of failures in some
> classes, notably:
> 
> CloudHttp2SolrClientTest
> CollectionsAPIDistributedZkTest
> DeleteReplicaTest
> TestDocCollectionWatcher
> 
> Unfortunately, the failure rate is not very high so reliably 
> reproducing is hard.
> 
> I’ve reproduced the last week’s failure in this e-mail, full 
> report attached. 
> 
> Here’s Hoss’ rollup:
> http://fucit.org/solr-jenkins-reports/failure-report.html
> 
> Usual synopsis:
> 
> Raw fail count by week totals, most recent week first (corresponds to bits):
> Week: 0  had  343 failures
> Week: 1  had  86 failures
> Week: 2  had  78 failures
> Week: 3  had  117 failures
> 
> 
> Failures in Hoss' reports for the last 4 rollups.
> 
> There were 497 unannotated tests that failed in Hoss' rollups. Ordered by the 
> date I downloaded the rollup file, newest->oldest. See above for the dates 
> the files were collected 
> These tests were NOT BadApple'd or AwaitsFix’d
> 
> Failures in the last 4 reports..
>Report   Pct runsfails   test
>  0123   0.7 1617 11  
> ConnectionManagerTest.testReconnectWhenZkDisappeared
>  0123   1.5 1606 12  ExecutePlanActionTest.testTaskTimeout
>  0123   1.6 1320 19  MultiThreadedOCPTest.test
>  0123   1.0 1620 13  RollingRestartTest.test
>  0123   1.2 1617 12  SearchRateTriggerTest.testWaitForElapsed
>  0123   3.8  119  7  ShardSplitTest.testSplitWithChaosMonkey
>  0123   0.3 1519  7  TestInPlaceUpdatesDistrib.test
>  0123   0.7 1629 14  
> TestIndexWriterDelete.testDeleteAllNoDeadLock
>  0123   2.4 1548 18  TestPackages.testPluginLoading
>  0123   0.3 1587  4  UnloadDistributedZkTest.test
> 
> 
> FAILURES IN THE LAST WEEK (343!)
> Look particularly at the ones with only a zero in 

Re: PLEASE READ! BadApple report. Last week was horrible!

2020-05-04 Thread Michael McCandless
Hi Erick,

It's possible this was the root cause of many of the failures:
https://issues.apache.org/jira/browse/LUCENE-9191

Do these transient failures look something like this?

   [junit4]> Throwable #1:
java.nio.charset.MalformedInputException: Input length = 1
   [junit4]>at
__randomizedtesting.SeedInfo.seed([172C6414BE5E2A2C:E5829DFC005A1F0]:0)
   [junit4]>at
java.base/java.nio.charset.CoderResult.throwException(CoderResult.java:274)
   [junit4]>at
java.base/sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:339)
   [junit4]>at
java.base/sun.nio.cs.StreamDecoder.read(StreamDecoder.java:178)
   [junit4]>at
java.base/java.io.InputStreamReader.read(InputStreamReader.java:185)
   [junit4]>at
java.base/java.io.BufferedReader.fill(BufferedReader.java:161)
   [junit4]>at
java.base/java.io.BufferedReader.readLine(BufferedReader.java:326)
   [junit4]>at
java.base/java.io.BufferedReader.readLine(BufferedReader.java:392)
   [junit4]>at
org.apache.lucene.util.LineFileDocs.open(LineFileDocs.java:175)
   [junit4]>at
org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:65)
   [junit4]>at
org.apache.lucene.util.LineFileDocs.(LineFileDocs.java:69)


If so, then it is likely the root cause ... I'm working on a fix.  Sorry!

Mike McCandless

http://blog.mikemccandless.com


On Mon, May 4, 2020 at 7:54 AM Erick Erickson 
wrote:

> I don’t know whether we had some temporary glitch that broke lots of tests
> and they’ve been fixed or we had a major regression, but this needs to be
> addressed ASAP if they’re still failing. See everything below the line "ALL
> OF THE TESTS BELOW HERE HAVE ONLY FAILED IN THE LAST WEEK!” in this e-mail.
> I’ll raise a JIRA if we can’t get some traction quickly here.
>
> Hey, stuff happens. there’s no problem with tests going totally weird for
> a while. If you can say “Oh, yeah, all those failures for class XYZ are
> probably fixed” that’s fine.
>
> Gosh-a-rooni, I hope my logging changes aren’t the culprit (gulp)….
>
> Hoss’ rolllup for the last 24 hours is not encouraging in terms of the
> problem already being fixed. There are lots of failures in some
> classes, notably:
>
> CloudHttp2SolrClientTest
> CollectionsAPIDistributedZkTest
> DeleteReplicaTest
> TestDocCollectionWatcher
>
> Unfortunately, the failure rate is not very high so reliably
> reproducing is hard.
>
> I’ve reproduced the last week’s failure in this e-mail, full
> report attached.
>
> Here’s Hoss’ rollup:
> http://fucit.org/solr-jenkins-reports/failure-report.html
>
> Usual synopsis:
>
> Raw fail count by week totals, most recent week first (corresponds to
> bits):
> Week: 0  had  343 failures
> Week: 1  had  86 failures
> Week: 2  had  78 failures
> Week: 3  had  117 failures
>
>
> Failures in Hoss' reports for the last 4 rollups.
>
> There were 497 unannotated tests that failed in Hoss' rollups. Ordered by
> the date I downloaded the rollup file, newest->oldest. See above for the
> dates the files were collected
> These tests were NOT BadApple'd or AwaitsFix’d
>
> Failures in the last 4 reports..
>Report   Pct runsfails   test
>  0123   0.7 1617 11
> ConnectionManagerTest.testReconnectWhenZkDisappeared
>  0123   1.5 1606 12  ExecutePlanActionTest.testTaskTimeout
>  0123   1.6 1320 19  MultiThreadedOCPTest.test
>  0123   1.0 1620 13  RollingRestartTest.test
>  0123   1.2 1617 12
> SearchRateTriggerTest.testWaitForElapsed
>  0123   3.8  119  7
> ShardSplitTest.testSplitWithChaosMonkey
>  0123   0.3 1519  7  TestInPlaceUpdatesDistrib.test
>  0123   0.7 1629 14
> TestIndexWriterDelete.testDeleteAllNoDeadLock
>  0123   2.4 1548 18  TestPackages.testPluginLoading
>  0123   0.3 1587  4  UnloadDistributedZkTest.test
> 
>
> FAILURES IN THE LAST WEEK (343!)
> Look particularly at the ones with only a zero in the “Report” column,
> those are
> failures that were _not_ in the previous 3 week’s rollups.
>
>Report   Pct runsfails   test
>  0120.5 1165  4  CustomHighlightComponentTest.test
>  0121.0 1168  6
> NodeMarkersRegistrationTest.testNodeMarkersRegistration
>  0121.0 1170  8  TestCryptoKeys.test
>  01 3   0.7 1233 11  LeaderFailoverAfterPartitionTest.test
>  01 3  63.2  102 39  StressHdfsTest.test
>  01 0.3  709  2
> ScheduledTriggerIntegrationTest.testScheduledTrigger
>  01 0.2  768  2  ShardRoutingTest.test
>  01 2.6  807 22  TestAllFilesHaveChecksumFooter.test
>  01 2.6  808 22  TestAllFilesHaveCodecHeader.test
>  01 0.2  769  2  TestCloudSchemaless.test
>  01 0.2

PLEASE READ! BadApple report. Last week was horrible!

2020-05-04 Thread Erick Erickson
I don’t know whether we had some temporary glitch that broke lots of tests and 
they’ve been fixed or we had a major regression, but this needs to be addressed 
ASAP if they’re still failing. See everything below the line "ALL OF THE TESTS 
BELOW HERE HAVE ONLY FAILED IN THE LAST WEEK!” in this e-mail. I’ll raise a 
JIRA if we can’t get some traction quickly here.

Hey, stuff happens. there’s no problem with tests going totally weird for a 
while. If you can say “Oh, yeah, all those failures for class XYZ are probably 
fixed” that’s fine.

Gosh-a-rooni, I hope my logging changes aren’t the culprit (gulp)….

Hoss’ rolllup for the last 24 hours is not encouraging in terms of the
problem already being fixed. There are lots of failures in some
classes, notably:

CloudHttp2SolrClientTest
CollectionsAPIDistributedZkTest
DeleteReplicaTest
TestDocCollectionWatcher

Unfortunately, the failure rate is not very high so reliably 
reproducing is hard.

I’ve reproduced the last week’s failure in this e-mail, full 
report attached. 

Here’s Hoss’ rollup:
http://fucit.org/solr-jenkins-reports/failure-report.html

Usual synopsis:

Raw fail count by week totals, most recent week first (corresponds to bits):
Week: 0  had  343 failures
Week: 1  had  86 failures
Week: 2  had  78 failures
Week: 3  had  117 failures


Failures in Hoss' reports for the last 4 rollups.

There were 497 unannotated tests that failed in Hoss' rollups. Ordered by the 
date I downloaded the rollup file, newest->oldest. See above for the dates the 
files were collected 
These tests were NOT BadApple'd or AwaitsFix’d

Failures in the last 4 reports..
   Report   Pct runsfails   test
 0123   0.7 1617 11  
ConnectionManagerTest.testReconnectWhenZkDisappeared
 0123   1.5 1606 12  ExecutePlanActionTest.testTaskTimeout
 0123   1.6 1320 19  MultiThreadedOCPTest.test
 0123   1.0 1620 13  RollingRestartTest.test
 0123   1.2 1617 12  SearchRateTriggerTest.testWaitForElapsed
 0123   3.8  119  7  ShardSplitTest.testSplitWithChaosMonkey
 0123   0.3 1519  7  TestInPlaceUpdatesDistrib.test
 0123   0.7 1629 14  
TestIndexWriterDelete.testDeleteAllNoDeadLock
 0123   2.4 1548 18  TestPackages.testPluginLoading
 0123   0.3 1587  4  UnloadDistributedZkTest.test


FAILURES IN THE LAST WEEK (343!)
Look particularly at the ones with only a zero in the “Report” column, those are
failures that were _not_ in the previous 3 week’s rollups.

   Report   Pct runsfails   test
 0120.5 1165  4  CustomHighlightComponentTest.test
 0121.0 1168  6  
NodeMarkersRegistrationTest.testNodeMarkersRegistration
 0121.0 1170  8  TestCryptoKeys.test
 01 3   0.7 1233 11  LeaderFailoverAfterPartitionTest.test
 01 3  63.2  102 39  StressHdfsTest.test
 01 0.3  709  2  
ScheduledTriggerIntegrationTest.testScheduledTrigger
 01 0.2  768  2  ShardRoutingTest.test
 01 2.6  807 22  TestAllFilesHaveChecksumFooter.test
 01 2.6  808 22  TestAllFilesHaveCodecHeader.test
 01 0.2  769  2  TestCloudSchemaless.test
 01 0.2  769  2  TestDynamicLoading.testDynamicLoading
 01 0.3  707  2  TestDynamicLoadingUrl.testDynamicLoadingUrl
 01 0.5  767  4  TestPointFields.testFloatPointStats
 0127.1   83 19  TestSQLHandler.doTest
 01 0.2  794 12  TestSameScoresWithThreads.test
 01 2.6  806 22  TestShardSearching.testSimple
 01 0.5  726  4  TestSimScenario.testSplitShard
 01 1.1  726  7  TestSimScenario.testSuggestions
 01 0.3  771  2  TestWithCollection.testAddReplicaSimple
 0 23   0.3 1223  4  
CdcrVersionReplicationTest.testCdcrDocVersions
 0 23   0.8 1172  6  
CloudHttp2SolrClientTest.testRetryUpdatesWhenClusterStateIsStale
 0 23   1.4 1202  8  CollectionsAPISolrJTest.testColStatus
 0 23   1.0 1249 11  HttpPartitionTest.test
 0 23   1.1 1210  8  HttpPartitionWithTlogReplicasTest.test
 0 23   0.5 1258  4  ShardSplitTest.testSplitShardWithRuleLink
 0 23   0.2 1231  4  
TestQueryingOnDownCollection.testQueryToDownCollectionShouldFailFast
 0 23   0.2 1232  6  TestSolrConfigHandlerCloud.test
 0 20.3  767  2  
DocValuesNotIndexedTest.testGroupingDVOnlySortLast
 0 20.3  750  2  TestLBHttp2SolrClient.testTwoServers
 0 20.3  794  2  TestSolrCloudSnapshots.testSnapshots
 0 2   40.7   51 12  

No BadApple report this week

2019-09-02 Thread Erick Erickson
I’ll probably just continue to gather Hoss’ rollups each week, but until we get 
the jenkins stuff back running it’s probably not worth the effort.
-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: Potential BadApple report this week

2018-04-12 Thread Erick Erickson
Thanks! I'll leave the timerouted stuff  on the permanent "don't
BadApple" list

On Thu, Apr 12, 2018 at 8:11 AM, David Smiley  wrote:
> I'm gong to work on TimeRoutedAliasUpdateProcessorTest.test "real soon now"
> (and for that matter any other stuff related to TRAs) so please don't
> BadApple it.
> Thanks.
>
> On Tue, Apr 10, 2018 at 10:30 AM Erick Erickson 
> wrote:
>>
>> Shalin:
>>
>> Got it, thanks
>>
>> On Tue, Apr 10, 2018 at 2:26 AM, Shalin Shekhar Mangar
>>  wrote:
>> > Hi Erick,
>> >
>> > You can go ahead and mark TestTriggerIntegration as BadApple. I don't
>> > think
>> > anyone is working on it. I was working on TriggerIntegrationTest which
>> > is a
>> > different test.
>> >
>> > On Mon, Apr 9, 2018 at 10:02 PM, Erick Erickson
>> > 
>> > wrote:
>> >>
>> >> OK, this is the first week I have Hoss' report from two weeks ago so
>> >> the list is rather lengthy.
>> >>
>> >>
>> >> I believe this test is being actively worked on, so no BadApple for it
>> >>
>> >>
>> >> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testEventQueue
>> >>
>> >> ***Tests I'll BadApple on Thursday.
>> >>
>> >> These are tests that failed in the last week and _also_ are failures
>> >> in Hoss' report from two weeks ago, so nobody has addressed them in
>> >> that time-frame.
>> >>
>> >> PLEASE LET ME KNOW BEFORE THURSDAY WHICH OF THESE SHOULD NOT BE
>> >> BADAPPLEd
>> >>
>> >>org.apache.lucene.index.TestIndexSorting.testRandom3
>> >>org.apache.solr.TestDistributedSearch.test
>> >>
>> >>
>> >> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testDistributions
>> >>org.apache.solr.cloud.AddReplicaTest.test
>> >>org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
>> >>org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
>> >>
>> >>
>> >> org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
>> >>org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
>> >>org.apache.solr.cloud.CreateRoutedAliasTest.testV1
>> >>org.apache.solr.cloud.CreateRoutedAliasTest.testV2
>> >>org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
>> >>
>> >>
>> >> org.apache.solr.cloud.TestCloudRecovery.leaderRecoverFromLogOnStartupTest
>> >>
>> >>
>> >> org.apache.solr.cloud.TestLeaderInitiatedRecoveryThread.testPublishDownState
>> >>org.apache.solr.cloud.TestStressInPlaceUpdates.stressTest
>> >>
>> >>
>> >> org.apache.solr.cloud.api.collections.TestCollectionsAPIViaSolrCloudCluster.testCollectionCreateWithoutCoresThenDelete
>> >>
>> >>
>> >> org.apache.solr.cloud.autoscaling.ComputePlanActionTest.testSelectedCollections
>> >>org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
>> >>
>> >>
>> >> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testNodeAddedTriggerRestoreState
>> >>
>> >>
>> >> org.apache.solr.common.cloud.TestCollectionStateWatchers.testDeletionsTriggerWatches
>> >>
>> >>
>> >> org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
>> >>
>> >>
>> >> org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
>> >>
>> >>
>> >> org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
>> >>
>> >> org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
>> >>
>> >> org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory
>> >>
>> >>
>> >> org.apache.solr.update.processor.TimeRoutedAliasUpdateProcessorTest.test
>> >>
>> >>
>> >> ***Tests currently BadApple-d
>> >>
>> >> *AwaitsFix Annotations:
>> >>
>> >>
>> >> Lucene AwaitsFix
>> >> TestControlledRealTimeReopenThread.java
>> >>testCRTReopen()
>> >>@AwaitsFix(bugUrl =
>> >> "https://issues.apache.org/jira/browse/LUCENE-5737;)
>> >>
>> >> TestICUNormalizer2CharFilter.java
>> >>testRandomStrings()
>> >>@AwaitsFix(bugUrl =
>> >> "https://issues.apache.org/jira/browse/LUCENE-5595;)
>> >>
>> >> TestMoreLikeThis.java
>> >>testMultiFieldShouldReturnPerFieldBooleanQuery()
>> >>@AwaitsFix(bugUrl =
>> >> "https://issues.apache.org/jira/browse/LUCENE-7161;)
>> >>
>> >> UIMABaseAnalyzerTest.java
>> >>testRandomStrings()
>> >>@Test @AwaitsFix(bugUrl =
>> >> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>> >>
>> >> UIMABaseAnalyzerTest.java
>> >>testRandomStringsWithConfigurationParameters()
>> >>@Test @AwaitsFix(bugUrl =
>> >> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>> >>
>> >> UIMATypeAwareAnalyzerTest.java
>> >>testRandomStrings()
>> >>@Test @AwaitsFix(bugUrl =
>> >> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>> >>
>> >>
>> >> Solr AwaitsFix
>> >> ReplaceNodeNoTargetTest.java
>> >>ReplaceNodeNoTargetTest suite
>> >>@LuceneTestCase.AwaitsFix(bugUrl =
>> >> 

Re: Potential BadApple report this week

2018-04-12 Thread David Smiley
I'm gong to work on TimeRoutedAliasUpdateProcessorTest.test "real soon now"
(and for that matter any other stuff related to TRAs) so please don't
BadApple it.
Thanks.

On Tue, Apr 10, 2018 at 10:30 AM Erick Erickson 
wrote:

> Shalin:
>
> Got it, thanks
>
> On Tue, Apr 10, 2018 at 2:26 AM, Shalin Shekhar Mangar
>  wrote:
> > Hi Erick,
> >
> > You can go ahead and mark TestTriggerIntegration as BadApple. I don't
> think
> > anyone is working on it. I was working on TriggerIntegrationTest which
> is a
> > different test.
> >
> > On Mon, Apr 9, 2018 at 10:02 PM, Erick Erickson  >
> > wrote:
> >>
> >> OK, this is the first week I have Hoss' report from two weeks ago so
> >> the list is rather lengthy.
> >>
> >>
> >> I believe this test is being actively worked on, so no BadApple for it
> >>
> >>
> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testEventQueue
> >>
> >> ***Tests I'll BadApple on Thursday.
> >>
> >> These are tests that failed in the last week and _also_ are failures
> >> in Hoss' report from two weeks ago, so nobody has addressed them in
> >> that time-frame.
> >>
> >> PLEASE LET ME KNOW BEFORE THURSDAY WHICH OF THESE SHOULD NOT BE
> BADAPPLEd
> >>
> >>org.apache.lucene.index.TestIndexSorting.testRandom3
> >>org.apache.solr.TestDistributedSearch.test
> >>
> >>
> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testDistributions
> >>org.apache.solr.cloud.AddReplicaTest.test
> >>org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
> >>org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
> >>
> >>
> org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
> >>org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
> >>org.apache.solr.cloud.CreateRoutedAliasTest.testV1
> >>org.apache.solr.cloud.CreateRoutedAliasTest.testV2
> >>org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
> >>
> >>
> org.apache.solr.cloud.TestCloudRecovery.leaderRecoverFromLogOnStartupTest
> >>
> >>
> org.apache.solr.cloud.TestLeaderInitiatedRecoveryThread.testPublishDownState
> >>org.apache.solr.cloud.TestStressInPlaceUpdates.stressTest
> >>
> >>
> org.apache.solr.cloud.api.collections.TestCollectionsAPIViaSolrCloudCluster.testCollectionCreateWithoutCoresThenDelete
> >>
> >>
> org.apache.solr.cloud.autoscaling.ComputePlanActionTest.testSelectedCollections
> >>org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
> >>
> >>
> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testNodeAddedTriggerRestoreState
> >>
> >>
> org.apache.solr.common.cloud.TestCollectionStateWatchers.testDeletionsTriggerWatches
> >>
> >>
> org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
> >>
> >>
> org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
> >>
> >>
> org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
> >>
> org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
> >>
> org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory
> >>
> >> org.apache.solr.update.processor.TimeRoutedAliasUpdateProcessorTest.test
> >>
> >>
> >> ***Tests currently BadApple-d
> >>
> >> *AwaitsFix Annotations:
> >>
> >>
> >> Lucene AwaitsFix
> >> TestControlledRealTimeReopenThread.java
> >>testCRTReopen()
> >>@AwaitsFix(bugUrl =
> >> "https://issues.apache.org/jira/browse/LUCENE-5737;)
> >>
> >> TestICUNormalizer2CharFilter.java
> >>testRandomStrings()
> >>@AwaitsFix(bugUrl =
> >> "https://issues.apache.org/jira/browse/LUCENE-5595;)
> >>
> >> TestMoreLikeThis.java
> >>testMultiFieldShouldReturnPerFieldBooleanQuery()
> >>@AwaitsFix(bugUrl =
> >> "https://issues.apache.org/jira/browse/LUCENE-7161;)
> >>
> >> UIMABaseAnalyzerTest.java
> >>testRandomStrings()
> >>@Test @AwaitsFix(bugUrl =
> >> "https://issues.apache.org/jira/browse/LUCENE-3869;)
> >>
> >> UIMABaseAnalyzerTest.java
> >>testRandomStringsWithConfigurationParameters()
> >>@Test @AwaitsFix(bugUrl =
> >> "https://issues.apache.org/jira/browse/LUCENE-3869;)
> >>
> >> UIMATypeAwareAnalyzerTest.java
> >>testRandomStrings()
> >>@Test @AwaitsFix(bugUrl =
> >> "https://issues.apache.org/jira/browse/LUCENE-3869;)
> >>
> >>
> >> Solr AwaitsFix
> >> ReplaceNodeNoTargetTest.java
> >>ReplaceNodeNoTargetTest suite
> >>@LuceneTestCase.AwaitsFix(bugUrl =
> >> "https://issues.apache.org/jira/browse/SOLR-11067;)
> >>
> >> TestCollapseQParserPlugin.java
> >>testStringCollapse()
> >>@AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/SOLR-11974
> ")
> >>
> >> TestImpersonationWithHadoopAuth.java
> >>testForwarding()
> >>@AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/HADOOP-9893
> ")
> >>
> >> TestLTRReRankingPipeline.java
> >> 

Re: Potential BadApple report this week

2018-04-10 Thread Erick Erickson
Shalin:

Got it, thanks

On Tue, Apr 10, 2018 at 2:26 AM, Shalin Shekhar Mangar
 wrote:
> Hi Erick,
>
> You can go ahead and mark TestTriggerIntegration as BadApple. I don't think
> anyone is working on it. I was working on TriggerIntegrationTest which is a
> different test.
>
> On Mon, Apr 9, 2018 at 10:02 PM, Erick Erickson 
> wrote:
>>
>> OK, this is the first week I have Hoss' report from two weeks ago so
>> the list is rather lengthy.
>>
>>
>> I believe this test is being actively worked on, so no BadApple for it
>>
>> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testEventQueue
>>
>> ***Tests I'll BadApple on Thursday.
>>
>> These are tests that failed in the last week and _also_ are failures
>> in Hoss' report from two weeks ago, so nobody has addressed them in
>> that time-frame.
>>
>> PLEASE LET ME KNOW BEFORE THURSDAY WHICH OF THESE SHOULD NOT BE BADAPPLEd
>>
>>org.apache.lucene.index.TestIndexSorting.testRandom3
>>org.apache.solr.TestDistributedSearch.test
>>
>> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testDistributions
>>org.apache.solr.cloud.AddReplicaTest.test
>>org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
>>org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
>>
>> org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
>>org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
>>org.apache.solr.cloud.CreateRoutedAliasTest.testV1
>>org.apache.solr.cloud.CreateRoutedAliasTest.testV2
>>org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
>>
>> org.apache.solr.cloud.TestCloudRecovery.leaderRecoverFromLogOnStartupTest
>>
>> org.apache.solr.cloud.TestLeaderInitiatedRecoveryThread.testPublishDownState
>>org.apache.solr.cloud.TestStressInPlaceUpdates.stressTest
>>
>> org.apache.solr.cloud.api.collections.TestCollectionsAPIViaSolrCloudCluster.testCollectionCreateWithoutCoresThenDelete
>>
>> org.apache.solr.cloud.autoscaling.ComputePlanActionTest.testSelectedCollections
>>org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
>>
>> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testNodeAddedTriggerRestoreState
>>
>> org.apache.solr.common.cloud.TestCollectionStateWatchers.testDeletionsTriggerWatches
>>
>> org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
>>
>> org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
>>
>> org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
>>org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
>>org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory
>>
>> org.apache.solr.update.processor.TimeRoutedAliasUpdateProcessorTest.test
>>
>>
>> ***Tests currently BadApple-d
>>
>> *AwaitsFix Annotations:
>>
>>
>> Lucene AwaitsFix
>> TestControlledRealTimeReopenThread.java
>>testCRTReopen()
>>@AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-5737;)
>>
>> TestICUNormalizer2CharFilter.java
>>testRandomStrings()
>>@AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-5595;)
>>
>> TestMoreLikeThis.java
>>testMultiFieldShouldReturnPerFieldBooleanQuery()
>>@AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-7161;)
>>
>> UIMABaseAnalyzerTest.java
>>testRandomStrings()
>>@Test @AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>>
>> UIMABaseAnalyzerTest.java
>>testRandomStringsWithConfigurationParameters()
>>@Test @AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>>
>> UIMATypeAwareAnalyzerTest.java
>>testRandomStrings()
>>@Test @AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>>
>>
>> Solr AwaitsFix
>> ReplaceNodeNoTargetTest.java
>>ReplaceNodeNoTargetTest suite
>>@LuceneTestCase.AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/SOLR-11067;)
>>
>> TestCollapseQParserPlugin.java
>>testStringCollapse()
>>@AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/SOLR-11974;)
>>
>> TestImpersonationWithHadoopAuth.java
>>testForwarding()
>>@AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/HADOOP-9893;)
>>
>> TestLTRReRankingPipeline.java
>>testDifferentTopN()
>>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/SOLR-11134;)
>>
>> TestMinMaxOnMultiValuedField.java
>>testDoubleFieldCache()
>>@AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-6709;)
>>
>> TestMinMaxOnMultiValuedField.java
>>testFloatFieldCache()
>>@AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-6709;)
>>
>> TestMinMaxOnMultiValuedField.java
>>testIntFieldCache()
>>@AwaitsFix(bugUrl =
>> 

Re: Potential BadApple report this week

2018-04-10 Thread Shalin Shekhar Mangar
Hi Erick,

You can go ahead and mark TestTriggerIntegration as BadApple. I don't think
anyone is working on it. I was working on TriggerIntegrationTest which is a
different test.

On Mon, Apr 9, 2018 at 10:02 PM, Erick Erickson 
wrote:

> OK, this is the first week I have Hoss' report from two weeks ago so
> the list is rather lengthy.
>
>
> I believe this test is being actively worked on, so no BadApple for it
>org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.
> testEventQueue
>
> ***Tests I'll BadApple on Thursday.
>
> These are tests that failed in the last week and _also_ are failures
> in Hoss' report from two weeks ago, so nobody has addressed them in
> that time-frame.
>
> PLEASE LET ME KNOW BEFORE THURSDAY WHICH OF THESE SHOULD NOT BE BADAPPLEd
>
>org.apache.lucene.index.TestIndexSorting.testRandom3
>org.apache.solr.TestDistributedSearch.test
>org.apache.solr.client.solrj.io.stream.StreamExpressionTest.
> testDistributions
>org.apache.solr.cloud.AddReplicaTest.test
>org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
>org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
>org.apache.solr.cloud.CreateRoutedAliasTest.
> testCollectionNamesMustBeAbsent
>org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
>org.apache.solr.cloud.CreateRoutedAliasTest.testV1
>org.apache.solr.cloud.CreateRoutedAliasTest.testV2
>org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
>org.apache.solr.cloud.TestCloudRecovery.leaderRecoverFromLogOnStartupT
> est
>org.apache.solr.cloud.TestLeaderInitiatedRecoveryThr
> ead.testPublishDownState
>org.apache.solr.cloud.TestStressInPlaceUpdates.stressTest
>org.apache.solr.cloud.api.collections.TestCollectionsAPIViaSolrCloud
> Cluster.testCollectionCreateWithoutCoresThenDelete
>org.apache.solr.cloud.autoscaling.ComputePlanActionTest.
> testSelectedCollections
>org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
>org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.
> testNodeAddedTriggerRestoreState
>org.apache.solr.common.cloud.TestCollectionStateWatchers.
> testDeletionsTriggerWatches
>org.apache.solr.handler.TestReplicationHandler.
> doTestIndexAndConfigReplication
>org.apache.solr.handler.TestReplicationHandler.
> doTestIndexFetchWithMasterUrl
>org.apache.solr.handler.TestReplicationHandler.
> doTestReplicateAfterCoreReload
>org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
>org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory
>org.apache.solr.update.processor.TimeRoutedAliasUpdateProcessor
> Test.test
>
>
> ***Tests currently BadApple-d
>
> *AwaitsFix Annotations:
>
>
> Lucene AwaitsFix
> TestControlledRealTimeReopenThread.java
>testCRTReopen()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5737
> ")
>
> TestICUNormalizer2CharFilter.java
>testRandomStrings()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5595
> ")
>
> TestMoreLikeThis.java
>testMultiFieldShouldReturnPerFieldBooleanQuery()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-7161
> ")
>
> UIMABaseAnalyzerTest.java
>testRandomStrings()
>@Test @AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>
> UIMABaseAnalyzerTest.java
>testRandomStringsWithConfigurationParameters()
>@Test @AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>
> UIMATypeAwareAnalyzerTest.java
>testRandomStrings()
>@Test @AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>
>
> Solr AwaitsFix
> ReplaceNodeNoTargetTest.java
>ReplaceNodeNoTargetTest suite
>@LuceneTestCase.AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/SOLR-11067;)
>
> TestCollapseQParserPlugin.java
>testStringCollapse()
>@AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/SOLR-11974;)
>
> TestImpersonationWithHadoopAuth.java
>testForwarding()
>@AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/HADOOP-9893;)
>
> TestLTRReRankingPipeline.java
>testDifferentTopN()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/SOLR-11134;)
>
> TestMinMaxOnMultiValuedField.java
>testDoubleFieldCache()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709
> ")
>
> TestMinMaxOnMultiValuedField.java
>testFloatFieldCache()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709
> ")
>
> TestMinMaxOnMultiValuedField.java
>testIntFieldCache()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709
> ")
>
> TestMinMaxOnMultiValuedField.java
>testLongFieldCache()
>@AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709
> ")
>
>
>
> 

Re: Potential BadApple report this week

2018-04-09 Thread Erick Erickson
Cool. I'll put them on my permanent "don't BadApple" list. If they're
still there in a month I'll ask again

And for everyone. I have no objection at all to leaving failing tests
in if people are working on them. I realize that sometimes the only
way to get them to fail is to leave them running

Erick



On Mon, Apr 9, 2018 at 9:38 AM, Steve Rowe  wrote:
> Hi Erick,
>
> Please don’t BadApple any TestReplicationHandler tests, I’m looking at them 
> this week:
>
>   
> org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
>   org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
>   
> org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
>   org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
>
> Thanks,
>
> --
> Steve
> www.lucidworks.com
>
>> On Apr 9, 2018, at 12:32 PM, Erick Erickson  wrote:
>>
>> OK, this is the first week I have Hoss' report from two weeks ago so
>> the list is rather lengthy.
>>
>>
>> I believe this test is being actively worked on, so no BadApple for it
>>   org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testEventQueue
>>
>> ***Tests I'll BadApple on Thursday.
>>
>> These are tests that failed in the last week and _also_ are failures
>> in Hoss' report from two weeks ago, so nobody has addressed them in
>> that time-frame.
>>
>> PLEASE LET ME KNOW BEFORE THURSDAY WHICH OF THESE SHOULD NOT BE BADAPPLEd
>>
>>   org.apache.lucene.index.TestIndexSorting.testRandom3
>>   org.apache.solr.TestDistributedSearch.test
>>   
>> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testDistributions
>>   org.apache.solr.cloud.AddReplicaTest.test
>>   org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
>>   org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
>>   org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
>>   org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
>>   org.apache.solr.cloud.CreateRoutedAliasTest.testV1
>>   org.apache.solr.cloud.CreateRoutedAliasTest.testV2
>>   org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
>>   org.apache.solr.cloud.TestCloudRecovery.leaderRecoverFromLogOnStartupTest
>>   
>> org.apache.solr.cloud.TestLeaderInitiatedRecoveryThread.testPublishDownState
>>   org.apache.solr.cloud.TestStressInPlaceUpdates.stressTest
>>   
>> org.apache.solr.cloud.api.collections.TestCollectionsAPIViaSolrCloudCluster.testCollectionCreateWithoutCoresThenDelete
>>   
>> org.apache.solr.cloud.autoscaling.ComputePlanActionTest.testSelectedCollections
>>   org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
>>   
>> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testNodeAddedTriggerRestoreState
>>   
>> org.apache.solr.common.cloud.TestCollectionStateWatchers.testDeletionsTriggerWatches
>>   
>> org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
>>   
>> org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
>>   
>> org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
>>   org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
>>   org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory
>>   org.apache.solr.update.processor.TimeRoutedAliasUpdateProcessorTest.test
>>
>>
>> ***Tests currently BadApple-d
>>
>> *AwaitsFix Annotations:
>>
>>
>> Lucene AwaitsFix
>> TestControlledRealTimeReopenThread.java
>>   testCRTReopen()
>>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5737;)
>>
>> TestICUNormalizer2CharFilter.java
>>   testRandomStrings()
>>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5595;)
>>
>> TestMoreLikeThis.java
>>   testMultiFieldShouldReturnPerFieldBooleanQuery()
>>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-7161;)
>>
>> UIMABaseAnalyzerTest.java
>>   testRandomStrings()
>>   @Test @AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>>
>> UIMABaseAnalyzerTest.java
>>   testRandomStringsWithConfigurationParameters()
>>   @Test @AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>>
>> UIMATypeAwareAnalyzerTest.java
>>   testRandomStrings()
>>   @Test @AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/LUCENE-3869;)
>>
>>
>>Solr AwaitsFix
>> ReplaceNodeNoTargetTest.java
>>   ReplaceNodeNoTargetTest suite
>>   @LuceneTestCase.AwaitsFix(bugUrl =
>> "https://issues.apache.org/jira/browse/SOLR-11067;)
>>
>> TestCollapseQParserPlugin.java
>>   testStringCollapse()
>>   @AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/SOLR-11974;)
>>
>> TestImpersonationWithHadoopAuth.java
>>   testForwarding()
>>   @AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/HADOOP-9893;)
>>
>> 

Re: Potential BadApple report this week

2018-04-09 Thread Steve Rowe
Hi Erick,

Please don’t BadApple any TestReplicationHandler tests, I’m looking at them 
this week:

  org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
  org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
  org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
  org.apache.solr.handler.TestReplicationHandler.doTestStressReplication

Thanks,

--
Steve
www.lucidworks.com

> On Apr 9, 2018, at 12:32 PM, Erick Erickson  wrote:
> 
> OK, this is the first week I have Hoss' report from two weeks ago so
> the list is rather lengthy.
> 
> 
> I believe this test is being actively worked on, so no BadApple for it
>   org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testEventQueue
> 
> ***Tests I'll BadApple on Thursday.
> 
> These are tests that failed in the last week and _also_ are failures
> in Hoss' report from two weeks ago, so nobody has addressed them in
> that time-frame.
> 
> PLEASE LET ME KNOW BEFORE THURSDAY WHICH OF THESE SHOULD NOT BE BADAPPLEd
> 
>   org.apache.lucene.index.TestIndexSorting.testRandom3
>   org.apache.solr.TestDistributedSearch.test
>   
> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testDistributions
>   org.apache.solr.cloud.AddReplicaTest.test
>   org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
>   org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
>   org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
>   org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
>   org.apache.solr.cloud.CreateRoutedAliasTest.testV1
>   org.apache.solr.cloud.CreateRoutedAliasTest.testV2
>   org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
>   org.apache.solr.cloud.TestCloudRecovery.leaderRecoverFromLogOnStartupTest
>   org.apache.solr.cloud.TestLeaderInitiatedRecoveryThread.testPublishDownState
>   org.apache.solr.cloud.TestStressInPlaceUpdates.stressTest
>   
> org.apache.solr.cloud.api.collections.TestCollectionsAPIViaSolrCloudCluster.testCollectionCreateWithoutCoresThenDelete
>   
> org.apache.solr.cloud.autoscaling.ComputePlanActionTest.testSelectedCollections
>   org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
>   
> org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testNodeAddedTriggerRestoreState
>   
> org.apache.solr.common.cloud.TestCollectionStateWatchers.testDeletionsTriggerWatches
>   
> org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
>   org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
>   
> org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
>   org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
>   org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory
>   org.apache.solr.update.processor.TimeRoutedAliasUpdateProcessorTest.test
> 
> 
> ***Tests currently BadApple-d
> 
> *AwaitsFix Annotations:
> 
> 
> Lucene AwaitsFix
> TestControlledRealTimeReopenThread.java
>   testCRTReopen()
>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5737;)
> 
> TestICUNormalizer2CharFilter.java
>   testRandomStrings()
>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5595;)
> 
> TestMoreLikeThis.java
>   testMultiFieldShouldReturnPerFieldBooleanQuery()
>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-7161;)
> 
> UIMABaseAnalyzerTest.java
>   testRandomStrings()
>   @Test @AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/LUCENE-3869;)
> 
> UIMABaseAnalyzerTest.java
>   testRandomStringsWithConfigurationParameters()
>   @Test @AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/LUCENE-3869;)
> 
> UIMATypeAwareAnalyzerTest.java
>   testRandomStrings()
>   @Test @AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/LUCENE-3869;)
> 
> 
>Solr AwaitsFix
> ReplaceNodeNoTargetTest.java
>   ReplaceNodeNoTargetTest suite
>   @LuceneTestCase.AwaitsFix(bugUrl =
> "https://issues.apache.org/jira/browse/SOLR-11067;)
> 
> TestCollapseQParserPlugin.java
>   testStringCollapse()
>   @AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/SOLR-11974;)
> 
> TestImpersonationWithHadoopAuth.java
>   testForwarding()
>   @AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/HADOOP-9893;)
> 
> TestLTRReRankingPipeline.java
>   testDifferentTopN()
>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/SOLR-11134;)
> 
> TestMinMaxOnMultiValuedField.java
>   testDoubleFieldCache()
>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709;)
> 
> TestMinMaxOnMultiValuedField.java
>   testFloatFieldCache()
>   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709;)
> 
> TestMinMaxOnMultiValuedField.java
>   testIntFieldCache()
>   @AwaitsFix(bugUrl = 

Potential BadApple report this week

2018-04-09 Thread Erick Erickson
OK, this is the first week I have Hoss' report from two weeks ago so
the list is rather lengthy.


I believe this test is being actively worked on, so no BadApple for it
   org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testEventQueue

***Tests I'll BadApple on Thursday.

These are tests that failed in the last week and _also_ are failures
in Hoss' report from two weeks ago, so nobody has addressed them in
that time-frame.

PLEASE LET ME KNOW BEFORE THURSDAY WHICH OF THESE SHOULD NOT BE BADAPPLEd

   org.apache.lucene.index.TestIndexSorting.testRandom3
   org.apache.solr.TestDistributedSearch.test
   org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testDistributions
   org.apache.solr.cloud.AddReplicaTest.test
   org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
   org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
   org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
   org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
   org.apache.solr.cloud.CreateRoutedAliasTest.testV1
   org.apache.solr.cloud.CreateRoutedAliasTest.testV2
   org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
   org.apache.solr.cloud.TestCloudRecovery.leaderRecoverFromLogOnStartupTest
   org.apache.solr.cloud.TestLeaderInitiatedRecoveryThread.testPublishDownState
   org.apache.solr.cloud.TestStressInPlaceUpdates.stressTest
   
org.apache.solr.cloud.api.collections.TestCollectionsAPIViaSolrCloudCluster.testCollectionCreateWithoutCoresThenDelete
   
org.apache.solr.cloud.autoscaling.ComputePlanActionTest.testSelectedCollections
   org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
   
org.apache.solr.cloud.autoscaling.sim.TestTriggerIntegration.testNodeAddedTriggerRestoreState
   
org.apache.solr.common.cloud.TestCollectionStateWatchers.testDeletionsTriggerWatches
   
org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigReplication
   org.apache.solr.handler.TestReplicationHandler.doTestIndexFetchWithMasterUrl
   org.apache.solr.handler.TestReplicationHandler.doTestReplicateAfterCoreReload
   org.apache.solr.handler.TestReplicationHandler.doTestStressReplication
   org.apache.solr.handler.admin.AutoscalingHistoryHandlerTest.testHistory
   org.apache.solr.update.processor.TimeRoutedAliasUpdateProcessorTest.test


***Tests currently BadApple-d

*AwaitsFix Annotations:


Lucene AwaitsFix
TestControlledRealTimeReopenThread.java
   testCRTReopen()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5737;)

TestICUNormalizer2CharFilter.java
   testRandomStrings()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-5595;)

TestMoreLikeThis.java
   testMultiFieldShouldReturnPerFieldBooleanQuery()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-7161;)

UIMABaseAnalyzerTest.java
   testRandomStrings()
   @Test @AwaitsFix(bugUrl =
"https://issues.apache.org/jira/browse/LUCENE-3869;)

UIMABaseAnalyzerTest.java
   testRandomStringsWithConfigurationParameters()
   @Test @AwaitsFix(bugUrl =
"https://issues.apache.org/jira/browse/LUCENE-3869;)

UIMATypeAwareAnalyzerTest.java
   testRandomStrings()
   @Test @AwaitsFix(bugUrl =
"https://issues.apache.org/jira/browse/LUCENE-3869;)


Solr AwaitsFix
ReplaceNodeNoTargetTest.java
   ReplaceNodeNoTargetTest suite
   @LuceneTestCase.AwaitsFix(bugUrl =
"https://issues.apache.org/jira/browse/SOLR-11067;)

TestCollapseQParserPlugin.java
   testStringCollapse()
   @AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/SOLR-11974;)

TestImpersonationWithHadoopAuth.java
   testForwarding()
   @AwaitsFix(bugUrl="https://issues.apache.org/jira/browse/HADOOP-9893;)

TestLTRReRankingPipeline.java
   testDifferentTopN()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/SOLR-11134;)

TestMinMaxOnMultiValuedField.java
   testDoubleFieldCache()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709;)

TestMinMaxOnMultiValuedField.java
   testFloatFieldCache()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709;)

TestMinMaxOnMultiValuedField.java
   testIntFieldCache()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709;)

TestMinMaxOnMultiValuedField.java
   testLongFieldCache()
   @AwaitsFix(bugUrl = "https://issues.apache.org/jira/browse/LUCENE-6709;)



*BadApple Annotations:

Lucene BadApple
TestLRUQueryCache.java
   testDocValuesUpdatesDontBreakCache()
   @BadApple(bugUrl="https://issues.apache.org/jira/browse/SOLR-12028;)


Solr BadApple
AliasIntegrationTest.java
   testModifyPropertiesCAR()
   @BadApple(bugUrl="https://issues.apache.org/jira/browse/SOLR-12028;)

AliasIntegrationTest.java
   testProperties()
   @BadApple(bugUrl="https://issues.apache.org/jira/browse/SOLR-12028;)

AutoAddReplicasIntegrationTest.java
   testSimple()
   

Re: BadApple report this week

2018-04-02 Thread Erick Erickson
Cool, I'll reconcile when it happens, perhaps this evening

On Mon, Apr 2, 2018 at 7:57 AM, Mark Miller  wrote:
> 4th report is coming today in a couple hours. Still a manual snag involved
> because I haven’t had a chance to figure out what happened with command line
> bitballoon site publishing.
>
> Mark
> On Mon, Apr 2, 2018 at 10:53 AM Erick Erickson 
> wrote:
>>
>> I don't have any candidates this week, I need one more week of
>> gathering data. But here's the overall report:
>>
>> Still collectins Hoss' reports, the 9th will be the first Monday I
>> have enough history.
>>
>> *Remove BadApple notations:
>>
>>
>> * Candidates this week (based on Mark's test results
>> only) Will BadApple them Thursday:
>>
>>
>>
>> All test failures
>>
>>
>> junit.framework.TestSuite.org.apache.lucene.index.TestIndexWriterThreadsToSegments
>> (18K bytes printed, exceeds 8K. just annotate)
>>
>> junit.framework.TestSuite.org.apache.solr.client.solrj.io.graph.GraphExpressionTest
>>
>> junit.framework.TestSuite.org.apache.solr.client.solrj.io.stream.SelectWithEvaluatorsTest
>>
>> junit.framework.TestSuite.org.apache.solr.cloud.api.collections.TestCollectionAPI
>>
>> junit.framework.TestSuite.org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest
>>
>> junit.framework.TestSuite.org.apache.solr.cloud.ConcurrentCreateRoutedAliasTest
>> junit.framework.TestSuite.org.apache.solr.cloud.DeleteNodeTest
>> junit.framework.TestSuite.org.apache.solr.cloud.RestartWhileUpdatingTest
>> junit.framework.TestSuite.org.apache.solr.cloud.ZkControllerTest
>> junit.framework.TestSuite.org.apache.solr.handler.TestReplicationHandler
>>
>> junit.framework.TestSuite.org.apache.solr.metrics.reporters.SolrJmxReporterCloudTest
>>
>> junit.framework.TestSuite.org.apache.solr.prometheus.collector.SolrCollectorTest
>>
>> junit.framework.TestSuite.org.apache.solr.prometheus.exporter.SolrExporterTest
>>
>> junit.framework.TestSuite.org.apache.solr.spelling.suggest.SuggesterWFSTTest
>> org.apache.lucene.index.TestBinaryDocValuesUpdates.testTonsOfUpdates
>>
>> org.apache.lucene.index.TestIndexingSequenceNumbers.testStressConcurrentCommit
>>
>> org.apache.lucene.index.TestIndexWriterThreadsToSegments.testSegmentCountOnFlushRandom
>>
>> org.apache.lucene.index.TestIndexWriterWithThreads.testIOExceptionDuringAbortWithThreads
>> org.apache.lucene.search.TestInetAddressRangeQueries.testRandomBig
>>
>> org.apache.solr.client.solrj.impl.CloudSolrClientTest.checkCollectionParameters
>>
>> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testGammaDistribution
>>
>> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testInnerJoinStream
>>
>> org.apache.solr.cloud.AliasIntegrationTest.testDeleteOneOfTwoCollectionsAliased
>> org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
>>
>> org.apache.solr.cloud.api.collections.ShardSplitTest.testSplitStaticIndexReplication
>> org.apache.solr.cloud.api.collections.TestCollectionAPI.test
>> org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
>> org.apache.solr.cloud.autoscaling.sim.TestComputePlanAction.testNodeAdded
>> org.apache.solr.cloud.autoscaling.sim.TestComputePlanAction.testNodeLost
>>
>> org.apache.solr.cloud.autoscaling.sim.TestPolicyCloud.testCreateCollectionAddShardWithReplicaTypeUsingPolicy
>> org.apache.solr.cloud.autoscaling.TriggerIntegrationTest.testMetricTrigger
>> org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
>>
>> org.apache.solr.cloud.ConcurrentCreateRoutedAliasTest.testConcurrentCreateRoutedAliasMinimal
>>
>> org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
>> org.apache.solr.cloud.CreateRoutedAliasTest.testRandomRouterNameFails
>> org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
>> org.apache.solr.cloud.CreateRoutedAliasTest.testV1
>> org.apache.solr.cloud.CreateRoutedAliasTest.testV2
>> org.apache.solr.cloud.DeleteInactiveReplicaTest.deleteInactiveReplicaTest
>> org.apache.solr.cloud.DeleteInactiveReplicaTest.deleteInactiveReplicaTest
>> (failing 100% 30-Mar?
>> org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
>> org.apache.solr.cloud.DocValuesNotIndexedTest.testGroupingDVOnly
>> org.apache.solr.cloud.FullSolrCloudDistribCmdsTest.test
>> org.apache.solr.cloud.hdfs.HDFSCollectionsAPITest.testDataDirIsNotReused
>> org.apache.solr.cloud.hdfs.StressHdfsTest.test
>> org.apache.solr.cloud.LIRRollingUpdatesTest.testNewLeaderAndMixedReplicas
>> org.apache.solr.cloud.LIRRollingUpdatesTest.testNewLeaderOldReplica
>> org.apache.solr.cloud.LIRRollingUpdatesTest.testNewReplicaOldLeader
>> org.apache.solr.cloud.LIRRollingUpdatesTest.testOldLeaderAndMixedReplicas
>> org.apache.solr.cloud.MoveReplicaHDFSTest.testFailedMove
>> org.apache.solr.cloud.RestartWhileUpdatingTest.test
>> org.apache.solr.cloud.TestPullReplica.testKillPullReplica
>>
>> 

Re: BadApple report this week

2018-04-02 Thread Mark Miller
4th report is coming today in a couple hours. Still a manual snag involved
because I haven’t had a chance to figure out what happened with command
line bitballoon site publishing.

Mark
On Mon, Apr 2, 2018 at 10:53 AM Erick Erickson 
wrote:

> I don't have any candidates this week, I need one more week of
> gathering data. But here's the overall report:
>
> Still collectins Hoss' reports, the 9th will be the first Monday I
> have enough history.
>
> *Remove BadApple notations:
>
>
> * Candidates this week (based on Mark's test results
> only) Will BadApple them Thursday:
>
>
>
> All test failures
>
> junit.framework.TestSuite.org
> .apache.lucene.index.TestIndexWriterThreadsToSegments
> (18K bytes printed, exceeds 8K. just annotate)
> junit.framework.TestSuite.org.apache.solr.client.solrj.io
> .graph.GraphExpressionTest
> junit.framework.TestSuite.org.apache.solr.client.solrj.io
> .stream.SelectWithEvaluatorsTest
> junit.framework.TestSuite.org
> .apache.solr.cloud.api.collections.TestCollectionAPI
> junit.framework.TestSuite.org
> .apache.solr.cloud.ChaosMonkeyNothingIsSafeTest
> junit.framework.TestSuite.org
> .apache.solr.cloud.ConcurrentCreateRoutedAliasTest
> junit.framework.TestSuite.org.apache.solr.cloud.DeleteNodeTest
> junit.framework.TestSuite.org.apache.solr.cloud.RestartWhileUpdatingTest
> junit.framework.TestSuite.org.apache.solr.cloud.ZkControllerTest
> junit.framework.TestSuite.org.apache.solr.handler.TestReplicationHandler
> junit.framework.TestSuite.org
> .apache.solr.metrics.reporters.SolrJmxReporterCloudTest
> junit.framework.TestSuite.org
> .apache.solr.prometheus.collector.SolrCollectorTest
> junit.framework.TestSuite.org
> .apache.solr.prometheus.exporter.SolrExporterTest
> junit.framework.TestSuite.org
> .apache.solr.spelling.suggest.SuggesterWFSTTest
> org.apache.lucene.index.TestBinaryDocValuesUpdates.testTonsOfUpdates
>
> org.apache.lucene.index.TestIndexingSequenceNumbers.testStressConcurrentCommit
>
> org.apache.lucene.index.TestIndexWriterThreadsToSegments.testSegmentCountOnFlushRandom
>
> org.apache.lucene.index.TestIndexWriterWithThreads.testIOExceptionDuringAbortWithThreads
> org.apache.lucene.search.TestInetAddressRangeQueries.testRandomBig
>
> org.apache.solr.client.solrj.impl.CloudSolrClientTest.checkCollectionParameters
>
> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testGammaDistribution
>
> org.apache.solr.client.solrj.io.stream.StreamExpressionTest.testInnerJoinStream
>
> org.apache.solr.cloud.AliasIntegrationTest.testDeleteOneOfTwoCollectionsAliased
> org.apache.solr.cloud.AliasIntegrationTest.testModifyPropertiesV1
>
> org.apache.solr.cloud.api.collections.ShardSplitTest.testSplitStaticIndexReplication
> org.apache.solr.cloud.api.collections.TestCollectionAPI.test
> org.apache.solr.cloud.autoscaling.ScheduledTriggerTest.testTrigger
> org.apache.solr.cloud.autoscaling.sim.TestComputePlanAction.testNodeAdded
> org.apache.solr.cloud.autoscaling.sim.TestComputePlanAction.testNodeLost
>
> org.apache.solr.cloud.autoscaling.sim.TestPolicyCloud.testCreateCollectionAddShardWithReplicaTypeUsingPolicy
> org.apache.solr.cloud.autoscaling.TriggerIntegrationTest.testMetricTrigger
> org.apache.solr.cloud.ChaosMonkeyNothingIsSafeTest.test
>
> org.apache.solr.cloud.ConcurrentCreateRoutedAliasTest.testConcurrentCreateRoutedAliasMinimal
> org.apache.solr.cloud.CreateRoutedAliasTest.testCollectionNamesMustBeAbsent
> org.apache.solr.cloud.CreateRoutedAliasTest.testRandomRouterNameFails
> org.apache.solr.cloud.CreateRoutedAliasTest.testTimezoneAbsoluteDate
> org.apache.solr.cloud.CreateRoutedAliasTest.testV1
> org.apache.solr.cloud.CreateRoutedAliasTest.testV2
> org.apache.solr.cloud.DeleteInactiveReplicaTest.deleteInactiveReplicaTest
> org.apache.solr.cloud.DeleteInactiveReplicaTest.deleteInactiveReplicaTest
> (failing 100% 30-Mar?
> org.apache.solr.cloud.DeleteReplicaTest.deleteReplicaOnIndexing
> org.apache.solr.cloud.DocValuesNotIndexedTest.testGroupingDVOnly
> org.apache.solr.cloud.FullSolrCloudDistribCmdsTest.test
> org.apache.solr.cloud.hdfs.HDFSCollectionsAPITest.testDataDirIsNotReused
> org.apache.solr.cloud.hdfs.StressHdfsTest.test
> org.apache.solr.cloud.LIRRollingUpdatesTest.testNewLeaderAndMixedReplicas
> org.apache.solr.cloud.LIRRollingUpdatesTest.testNewLeaderOldReplica
> org.apache.solr.cloud.LIRRollingUpdatesTest.testNewReplicaOldLeader
> org.apache.solr.cloud.LIRRollingUpdatesTest.testOldLeaderAndMixedReplicas
> org.apache.solr.cloud.MoveReplicaHDFSTest.testFailedMove
> org.apache.solr.cloud.RestartWhileUpdatingTest.test
> org.apache.solr.cloud.TestPullReplica.testKillPullReplica
>
> org.apache.solr.cloud.TestPullReplicaErrorHandling.testCantConnectToPullReplica
> org.apache.solr.cloud.UnloadDistributedZkTest.test
> org.apache.solr.core.TestCustomStream.testDynamicLoadingCustomStream
>
> org.apache.solr.handler.admin.StatsReloadRaceTest.testParallelReloadAndStats
>
>