ShingleFilter skips over trie-shingles if outputUnigram is set to false
-----------------------------------------------------------------------
Key: LUCENE-2199
URL: https://issues.apache.org/jira/browse/LUCENE-2199
Project: Lucene - Java
Issue Type: Bug
Components: contrib/analyzers
Affects Versions: 3.0, 2.9.1, 2.9, 2.4.1, 2.4
Reporter: Simon Willnauer
Fix For: 3.1
Spinoff from http://lucene.markmail.org/message/uq4xdjk26yduvnpa
{quote}
I noticed that if I set outputUnigrams to false it gives me the same output for
maxShingleSize=2 and maxShingleSize=3.
please divide divide this this sentence
when i set maxShingleSize to 4 output is:
please divide please divide this sentence divide this this sentence
I was expecting the output as follows with maxShingleSize=3 and
outputUnigrams=false :
please divide this divide this sentence
{quote}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]