Hi Egorlex, Set the tokenSeparator to "" and ShingleFilter will concatenate all shingles without whitespace. Keep in mind, this will greatly increase the size of the index so it might not be a good idea to concatenate all pairs of words.
If you are looking for finding "similarissues" with "similar issues" (and vice versa) you might want to check out DictionaryCompoundWordTokenFilter and/or HyphenationCompoundWordTokenFilter. Although English hardly uses compound words, the token filters still do their job quite nicely. Regards, Markus -----Original message----- > From:egorlex <egor...@gmail.com> > Sent: Wednesday 20th June 2018 11:42 > To: java-user@lucene.apache.org > Subject: Re: Lucene same search result for worlds with and without spaces > > Thanks for replay! > > sorry, could you help a little, according to example > > "given the phrase “Shingles is a viral disease”, a shingle filter might > produce: > > Shingles is > is a > a viral > viral disease > " > > I do not quite understand how this ShingleFilter can turn "similarissues" > into "similar issues" > > Thanks! > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Lucene-Java-Users-f532864.html > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org