[jira] [Updated] (LUCENE-7476) Fix transient failure in JapaneseNumberFilter run from TestFactories

Michael McCandless (JIRA) Mon, 10 Oct 2016 07:43:36 -0700

     [ 
https://issues.apache.org/jira/browse/LUCENE-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Michael McCandless updated LUCENE-7476:
---------------------------------------
    Attachment: LUCENE-7476.patch

OK thanks [~andyhind], I think this is in fact a real bug!

The reason it happens in Eclipse but not ant is because when {{TestFactories}} 
runs from Eclipse, it apparently includes the {{JapaneseNumberFilter}} in the 
{{CLASSPATH}} but ant does not, which is annoying.  (There is a TODO about this 
in {{TestFactories}}).

I'm attaching a patch (based on 6.x) that copies {{TestFactories.java}} into 
the kuromoji module, and then I can reproduce this failure:

{noformat}
.TEST FAIL: useCharFilter=false text='alari/power.html 301'
ENOTE: reproduce with: ant test  -Dtestcase=TestFactories -Dtests.method=test 
-Dtests.seed=4139A5350B1C10BE -Dtests.slow=true 
-Dtests.linedocsfile=/lucenedata/hudson.enwiki.random.lines.txt.fixed 
-Dtests.locale=cs -Dtests.timezone=America/Paramaribo -Dtests.asserts=true 
-Dtests.file.encoding=UTF-8
NOTE: test params are: 
codec=FastCompressingStoredFields(storedFieldsFormat=CompressingStoredFieldsFormat(compressionMode=FAST,
 chunkSize=6, maxDocsPerChunk=1004, blockSize=452), 
termVectorsFormat=CompressingTermVectorsFormat(compressionMode=FAST, 
chunkSize=6, blockSize=452)), sim=ClassicSimilarity, locale=cs, 
timezone=America/Paramaribo
NOTE: Linux 4.4.0-38-generic amd64/Oracle Corporation 1.8.0_92 
(64-bit)/cpus=8,threads=1,free=415218864,total=514850816
NOTE: All tests run in this JVM: [TestFactories]

Time: 2.298
There was 1 failure:
1) test(org.apache.lucene.analysis.ja.TestFactories)
java.lang.IllegalStateException: incrementToken() called while in wrong state: 
INCREMENT_FALSE
        at 
__randomizedtesting.SeedInfo.seed([4139A5350B1C10BE:C96D9AEFA5E07D46]:0)
        at org.apache.lucene.analysis.MockTokenizer.fail(MockTokenizer.java:125)
        at 
org.apache.lucene.analysis.MockTokenizer.incrementToken(MockTokenizer.java:136)
        at 
org.apache.lucene.analysis.ja.JapaneseNumberFilter.incrementToken(JapaneseNumberFilter.java:152)
        at 
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkAnalysisConsistency(BaseTokenStreamTestCase.java:716)
        at 
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:627)
        at 
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:525)
        at 
org.apache.lucene.analysis.ja.TestFactories.doTestTokenFilter(TestFactories.java:104)
        at 
org.apache.lucene.analysis.ja.TestFactories.test(TestFactories.java:57)
{noformat}

> Fix transient failure in JapaneseNumberFilter run from TestFactories
> --------------------------------------------------------------------
>
>                 Key: LUCENE-7476
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7476
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/other
>    Affects Versions: 6.2.1
>            Reporter: Andy Hind
>            Priority: Trivial
>         Attachments: LUCENE-7476.patch, LUCENE-7476.patch
>
>
> Repeatedly running TestFactories show this test to fail ~10% of the time.
> I believe the fix is trivial and related to loosing the state of the 
> underlying input stream when testing some analyzer life cycle flows. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-7476) Fix transient failure in JapaneseNumberFilter run from TestFactories

Reply via email to