Ere Maijala created SOLR-11811:
----------------------------------

             Summary: Support for defining a Unicode set filter when using 
ICUFoldingFilter
                 Key: SOLR-11811
                 URL: https://issues.apache.org/jira/browse/SOLR-11811
             Project: Solr
          Issue Type: Improvement
      Security Level: Public (Default Security Level. Issues are Public)
          Components: Schema and Analysis
            Reporter: Ere Maijala
            Priority: Minor


ne a Unicode set filter, ICUFoldingFilterFactory does not support it. A filter 
allows one to e.g. exclude a set of characters from being folded. E.g. for 
Finnish and Swedish the filter could be defined like this:

      <filter class="solr.ICUFoldingFilterFactory" filter="[^åäöÅÄÖ]"/>

(Note: An additional MappingCharFilterFactory for lowercasing the characters 
excluded from folding would be needed for perfect results.)

I'll add a patch that does this similar to ICUNormalizer2FilterFactory. Applies 
at least to master and branch_7x.





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to