David Smiley created LUCENE-7865:
------------------------------------

             Summary: TestICUTokenizer.testRandomHugeStrings failure
                 Key: LUCENE-7865
                 URL: https://issues.apache.org/jira/browse/LUCENE-7865
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/analysis
            Reporter: David Smiley


This is reproducible:
{{ant test  -Dtestcase=TestICUTokenizer -Dtests.method=testRandomHugeStrings 
-Dtests.seed=E673DE09BC7FA047 -Dtests.slow=true -Dtests.locale=zh-SG 
-Dtests.timezone=Pacific/Johnston -Dtests.asserts=true 
-Dtests.file.encoding=ISO-8859-1}}

{noformat}
   [junit4] ERROR   0.92s | TestICUTokenizer.testRandomHugeStrings <<<
   [junit4]    > Throwable #1: java.lang.ArrayIndexOutOfBoundsException: 170
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([E673DE09BC7FA047:7E50B9CAE2091C0F]:0)
   [junit4]    >        at 
org.apache.lucene.analysis.icu.segmentation.CompositeBreakIterator.getBreakIterator(CompositeBreakIterator.java:123)
   [junit4]    >        at 
org.apache.lucene.analysis.icu.segmentation.CompositeBreakIterator.next(CompositeBreakIterator.java:62)
   [junit4]    >        at 
org.apache.lucene.analysis.icu.segmentation.ICUTokenizer.incrementTokenBuffer(ICUTokenizer.java:210)
   [junit4]    >        at 
org.apache.lucene.analysis.icu.segmentation.ICUTokenizer.incrementToken(ICUTokenizer.java:104)
   [junit4]    >        at 
org.apache.lucene.analysis.icu.ICUNormalizer2Filter.incrementToken(ICUNormalizer2Filter.java:80)
   [junit4]    >        at 
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkAnalysisConsistency(BaseTokenStreamTestCase.java:731)
   [junit4]    >        at 
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:642)
   [junit4]    >        at 
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:540)
   [junit4]    >        at 
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:453)
   [junit4]    >        at 
org.apache.lucene.analysis.icu.segmentation.TestICUTokenizer.testRandomHugeStrings(TestICUTokenizer.java:276)
   [junit4]    >        at java.lang.Thread.run(Thread.java:745)
   [junit4]   2> NOTE: test params are: codec=Asserting(Lucene70): 
{dummy=PostingsFormat(name=Memory)}, docValues:{}, maxPointsInLeafNode=582, 
maxMBSortInHeap=5.4626768750424, sim=RandomSimilarity(queryNorm=true): {}, 
locale=zh-SG, timezone=Pacific/Johnston
   [junit4]   2> NOTE: Mac OS X 10.12.5 x86_64/Oracle Corporation 1.8.0_121 
(64-bit)/cpus=8,threads=1,free=122270736,total=164102144
   [junit4]   2> NOTE: All tests run in this JVM: [TestICUTokenizer]
{noformat}

Searching my email shows this test has failed a couple times by CI servers in 
the past.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to