synonyms are matched case-insensitive but inserted as case-sensitive
--------------------------------------------------------------------
Key: SOLR-809
URL: https://issues.apache.org/jira/browse/SOLR-809
Project: Solr
Issue Type: Bug
Reporter: Geoffrey Young
Priority: Minor
I had a synonyms file with this line
2, to, two, too, II
(that's a capital ii last in the list)
and an index with
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
ignoreCase="true" expand="true"/>
in that order. a query for 'This II That' failed to match the same text in the
index because the synonym filter appears to have expanded the (now) lowercase
this ii that
to
this 2 to two too II that
and then the (also) lowercased query failed to match.
so it seems that synonym expansion is matching tokens in a case-insensitive
manner when it probably shouldn't. or it could dwim but re-add in the original
token instead of the matched one, but that's a bit more roundabout.
I just made sure my entire synonyms file was lowercase.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.