David Smiley created LUCENE-7865:
------------------------------------
Summary: TestICUTokenizer.testRandomHugeStrings failure
Key: LUCENE-7865
URL: https://issues.apache.org/jira/browse/LUCENE-7865
Project: Lucene - Core
Issue Type: Bug
Components: modules/analysis
Reporter: David Smiley
This is reproducible:
{{ant test -Dtestcase=TestICUTokenizer -Dtests.method=testRandomHugeStrings
-Dtests.seed=E673DE09BC7FA047 -Dtests.slow=true -Dtests.locale=zh-SG
-Dtests.timezone=Pacific/Johnston -Dtests.asserts=true
-Dtests.file.encoding=ISO-8859-1}}
{noformat}
[junit4] ERROR 0.92s | TestICUTokenizer.testRandomHugeStrings <<<
[junit4] > Throwable #1: java.lang.ArrayIndexOutOfBoundsException: 170
[junit4] > at
__randomizedtesting.SeedInfo.seed([E673DE09BC7FA047:7E50B9CAE2091C0F]:0)
[junit4] > at
org.apache.lucene.analysis.icu.segmentation.CompositeBreakIterator.getBreakIterator(CompositeBreakIterator.java:123)
[junit4] > at
org.apache.lucene.analysis.icu.segmentation.CompositeBreakIterator.next(CompositeBreakIterator.java:62)
[junit4] > at
org.apache.lucene.analysis.icu.segmentation.ICUTokenizer.incrementTokenBuffer(ICUTokenizer.java:210)
[junit4] > at
org.apache.lucene.analysis.icu.segmentation.ICUTokenizer.incrementToken(ICUTokenizer.java:104)
[junit4] > at
org.apache.lucene.analysis.icu.ICUNormalizer2Filter.incrementToken(ICUNormalizer2Filter.java:80)
[junit4] > at
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkAnalysisConsistency(BaseTokenStreamTestCase.java:731)
[junit4] > at
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:642)
[junit4] > at
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:540)
[junit4] > at
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:453)
[junit4] > at
org.apache.lucene.analysis.icu.segmentation.TestICUTokenizer.testRandomHugeStrings(TestICUTokenizer.java:276)
[junit4] > at java.lang.Thread.run(Thread.java:745)
[junit4] 2> NOTE: test params are: codec=Asserting(Lucene70):
{dummy=PostingsFormat(name=Memory)}, docValues:{}, maxPointsInLeafNode=582,
maxMBSortInHeap=5.4626768750424, sim=RandomSimilarity(queryNorm=true): {},
locale=zh-SG, timezone=Pacific/Johnston
[junit4] 2> NOTE: Mac OS X 10.12.5 x86_64/Oracle Corporation 1.8.0_121
(64-bit)/cpus=8,threads=1,free=122270736,total=164102144
[junit4] 2> NOTE: All tests run in this JVM: [TestICUTokenizer]
{noformat}
Searching my email shows this test has failed a couple times by CI servers in
the past.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]