[
https://issues.apache.org/jira/browse/LUCENE-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15481940#comment-15481940
]
Steve Rowe commented on LUCENE-7442:
------------------------------------
Another failure from my Jenkins caused by MinHashFilter not validating its args:
{noformat}
[junit4] Suite: org.apache.lucene.analysis.core.TestRandomChains
[junit4] 2> NOTE: reproduce with: ant test -Dtestcase=TestRandomChains
-Dtests.method=testRandomChainsWithLargeStrings -Dtests.seed=4269222C7AE3CDA1
-Dtests.slow=true -Dtests.locale=lt -Dtests.timezone=America/Fortaleza
-Dtests.asserts=true -Dtests.file.encoding=UTF-8
[junit4] ERROR 0.84s J8 |
TestRandomChains.testRandomChainsWithLargeStrings <<<
[junit4] > Throwable #1: java.lang.ArithmeticException: / by zero
[junit4] > at
__randomizedtesting.SeedInfo.seed([4269222C7AE3CDA1:28329D3D23ADED52]:0)
[junit4] > at
org.apache.lucene.analysis.minhash.MinHashFilter.<init>(MinHashFilter.java:121)
[junit4] > at
java.lang.reflect.Constructor.newInstance(Constructor.java:423)
[junit4] > at
org.apache.lucene.analysis.core.TestRandomChains$MockRandomAnalyzer.createComponent(TestRandomChains.java:713)
[junit4] > at
org.apache.lucene.analysis.core.TestRandomChains$MockRandomAnalyzer.newFilterChain(TestRandomChains.java:823)
[junit4] > at
org.apache.lucene.analysis.core.TestRandomChains$MockRandomAnalyzer.toString(TestRandomChains.java:702)
[junit4] > at java.lang.String.valueOf(String.java:2994)
[junit4] > at
java.lang.StringBuilder.append(StringBuilder.java:131)
[junit4] > at
org.apache.lucene.analysis.core.TestRandomChains.testRandomChainsWithLargeStrings(TestRandomChains.java:962)
[junit4] > at java.lang.Thread.run(Thread.java:745)
[junit4] 2> NOTE: test params are: codec=Asserting(Lucene62),
sim=ClassicSimilarity, locale=lt, timezone=America/Fortaleza
[junit4] 2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation
1.8.0_77 (64-bit)/cpus=16,threads=1,free=382482496,total=519569408
[junit4] 2> NOTE: All tests run in this JVM:
[TestReverseStringFilterFactory, TestLengthFilter, TestGermanStemFilter,
TestKStemmer, TestNeedAffix, TestCapitalizationFilter,
TestScandinavianNormalizationFilter, TestFlagNum, TestStopAnalyzer,
TestLatvianStemFilterFactory, TestGermanLightStemFilter,
TestPatternTokenizerFactory, TestDanishAnalyzer, TestHindiFilters,
TestUnicodeWhitespaceTokenizer, TestThaiTokenizerFactory, TestThaiAnalyzer,
TestRandomChains]
[junit4] Completed [114/272 (1!)] on J8 in 4.79s, 2 tests, 1 error <<<
FAILURES!
{noformat}
> MinHashFilter.FixedSizeTreeSet.add() calls TreeSet.last() without first
> testing for emptiness, under which condition NoSuchElementException is thrown
> -----------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: LUCENE-7442
> URL: https://issues.apache.org/jira/browse/LUCENE-7442
> Project: Lucene - Core
> Issue Type: Bug
> Components: modules/analysis
> Reporter: Steve Rowe
> Attachments: LUCENE-7442.patch
>
>
> My Jenkins found this reproducing branch_6x seed:
> {noformat}
> [junit4] Suite: org.apache.lucene.analysis.core.TestRandomChains
> [junit4] 2> Exception from random analyzer:
> [junit4] 2> charfilters=
> [junit4] 2> tokenizer=
> [junit4] 2> org.apache.lucene.analysis.standard.StandardTokenizer()
> [junit4] 2> filters=
> [junit4] 2>
> org.apache.lucene.analysis.minhash.MinHashFilter(ValidatingTokenFilter@6ae99167
>
> term=,bytes=[],startOffset=0,endOffset=0,positionIncrement=1,positionLength=1,type=word,
> 5, 5, -3, true)
> [junit4] 2>
> org.apache.lucene.analysis.bg.BulgarianStemFilter(ValidatingTokenFilter@40844352
>
> term=,bytes=[],startOffset=0,endOffset=0,positionIncrement=1,positionLength=1,type=word,keyword=false)
> [junit4] 2> offsetsAreCorrect=true
> [junit4] 2> NOTE: reproduce with: ant test -Dtestcase=TestRandomChains
> -Dtests.method=testRandomChainsWithLargeStrings -Dtests.seed=4733E677EBDC28FC
> -Dtests.slow=true -Dtests.locale=ar-OM
> -Dtests.timezone=Atlantic/South_Georgia -Dtests.asserts=true
> -Dtests.file.encoding=UTF-8
> [junit4] ERROR 3.18s J4 |
> TestRandomChains.testRandomChainsWithLargeStrings <<<
> [junit4] > Throwable #1: java.util.NoSuchElementException
> [junit4] > at
> __randomizedtesting.SeedInfo.seed([4733E677EBDC28FC:2D685966B292080F]:0)
> [junit4] > at java.util.TreeMap.key(TreeMap.java:1323)
> [junit4] > at java.util.TreeMap.lastKey(TreeMap.java:297)
> [junit4] > at java.util.TreeSet.last(TreeSet.java:401)
> [junit4] > at
> org.apache.lucene.analysis.minhash.MinHashFilter$FixedSizeTreeSet.add(MinHashFilter.java:325)
> [junit4] > at
> org.apache.lucene.analysis.minhash.MinHashFilter.incrementToken(MinHashFilter.java:159)
> [junit4] > at
> org.apache.lucene.analysis.ValidatingTokenFilter.incrementToken(ValidatingTokenFilter.java:67)
> [junit4] > at
> org.apache.lucene.analysis.bg.BulgarianStemFilter.incrementToken(BulgarianStemFilter.java:48)
> [junit4] > at
> org.apache.lucene.analysis.ValidatingTokenFilter.incrementToken(ValidatingTokenFilter.java:67)
> [junit4] > at
> org.apache.lucene.analysis.BaseTokenStreamTestCase.checkResetException(BaseTokenStreamTestCase.java:405)
> [junit4] > at
> org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:510)
> [junit4] > at
> org.apache.lucene.analysis.core.TestRandomChains.testRandomChainsWithLargeStrings(TestRandomChains.java:959)
> [junit4] > at java.lang.Thread.run(Thread.java:745)
> [junit4] 2> NOTE: test params are: codec=Asserting(Lucene62):
> {dummy=Lucene50(blocksize=128)}, docValues:{}, maxPointsInLeafNode=252,
> maxMBSortInHeap=5.297834377897023, sim=ClassicSimilarity, locale=ar-OM,
> timezone=Atlantic/South_Georgia
> [junit4] 2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation
> 1.8.0_77 (64-bit)/cpus=16,threads=1,free=395080152,total=465567744
> [junit4] 2> NOTE: All tests run in this JVM:
> [TestDecimalDigitFilterFactory, TestMultiWordSynonyms,
> TestReversePathHierarchyTokenizer, TestDoubleEscape,
> TestHunspellStemFilterFactory, TestArabicNormalizationFilter,
> TestUAX29URLEmailAnalyzer, TestSwedishLightStemFilterFactory,
> TestBulgarianStemmer, TestASCIIFoldingFilter,
> TestDelimitedPayloadTokenFilterFactory, TestIndonesianStemmer, TestCircumfix,
> EdgeNGramTokenFilterTest, TestPatternTokenizer,
> TestScandinavianFoldingFilter, TestIgnore, TestRandomChains]
> [junit4] Completed [130/272 (1!)] on J4 in 9.85s, 2 tests, 1 error <<<
> FAILURES!
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]