[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Artem Lukanin updated LUCENE-5051: ---------------------------------- Attachment: LUCENE-5051.patch I've added a test, demonstrating the bug. > Incorrect abbreviation synonyms treating in WordDelimiterFilter > --------------------------------------------------------------- > > Key: LUCENE-5051 > URL: https://issues.apache.org/jira/browse/LUCENE-5051 > Project: Lucene - Core > Issue Type: Bug > Affects Versions: 4.3, 4.3.1 > Reporter: Artem Lukanin > Attachments: incorrect_synonym_treating_sample.patch, > LUCENE-5051.patch > > > If there are 2 abbreviation synonyms in the stream, they are not treated as > synonyms after splitting by dots in WordDelimiterFilter. > Correct treatment: > before and after WordDelimiterFilter: > {code} > tokens: wi fi wireles network > positions: 1 2 1 2 > {code} > Incorrect treatment: > before WordDelimiterFilter: > {code} > tokens: wi.fi. wireles.network. > positions: 1 1 > {code} > after WordDelimiterFilter: > {code} > tokens: wi fi wireles network > positions: 1 2 3 4 > {code} > but should be: > {code} > tokens: wi wireles fi network > positions: 1 1 2 2 > {code} > See a patch for Solr 4.3.1 configs, which demonstrates the bug if "wi.fi. > router" is analyzed in name_syn field. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org