synonyms are matched case-insensitive but inserted as case-sensitive
--------------------------------------------------------------------

                 Key: SOLR-809
                 URL: https://issues.apache.org/jira/browse/SOLR-809
             Project: Solr
          Issue Type: Bug
            Reporter: Geoffrey Young
            Priority: Minor


I had a synonyms file with this line

  2, to, two, too, II

(that's a capital ii last in the list)

and an index with

      <filter class="solr.LowerCaseFilterFactory"/>
      <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" 
ignoreCase="true" expand="true"/>

in that order.  a query for 'This II That' failed to match the same text in the 
index because the synonym filter appears to have expanded the (now) lowercase

  this ii that

to

  this 2 to two too II that

and then the (also) lowercased query failed to match.

so it seems that synonym expansion is matching  tokens in a case-insensitive 
manner when it probably shouldn't.  or it could dwim but re-add in the original 
token instead of the matched one, but that's a bit more roundabout.

I just made sure my entire synonyms file was lowercase.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to