Hello,

 

What is the best way to search in a field ignoring accents?

 

The field has the type:

                <fieldType name="text_general_edge_ngram"
class="solr.TextField" positionIncrementGap="100">

                               <analyzer type="index">

                                               <tokenizer
class="solr.LowerCaseTokenizerFactory"/>

                                               <filter
class="solr.EdgeNGramFilterFactory" minGramSize="2" maxGramSize="15"/>

                               </analyzer>

                               <analyzer type="query">

                                               <tokenizer
class="solr.LowerCaseTokenizerFactory"/>

                                               <filter
class="solr.EdgeNGramFilterFactory" minGramSize="2" maxGramSize="15"/>

                               </analyzer>

                </fieldType>

 

I’ve tried adding the filter:  <filter
class="solr.ASCIIFoldingFilterFactory"/> 

but some strange results happened.. like:

 

Search by “Mourao” and the results were:

Mourão -> OK

Monteiro -> NOTOK

Morais -> NOTOK

 

Thanks in advanced,

 


Pedro Figueiredo
Senior Engineer

 <mailto:pjlfigueir...@criticalsoftware.com>
pjlfigueir...@criticalsoftware.com
M. 934058150


 




Rua Engº Frederico Ulrich, nº 2650 4470-605 Moreira da Maia, Portugal
T. +351 229 446 927 | F. +351 229 446 929
 <http://www.criticalsoftware.com/> www.criticalsoftware.com

PORTUGAL | UK | GERMANY | USA | BRAZIL | MOZAMBIQUE | ANGOLA
 <http://cmmiinstitute.com/> A CMMI® LEVEL 5 RATED COMPANY CMMI® is
registered in the USPTO by  <http://www.cmu.edu/> CMU"


 

 

 

Reply via email to