aerox7 wrote:
Hi,
I have a mysql data base in UTF-8. I have a row with "Solène" (solène). I
want to transforme this to solene, so i use Solr
ISOLatin1AccentFilterFactory to perform this task but it dosn't work ?!!

i gess that "Solène" is "solène" in UTF-8 ?! i also set tomcat to utf-8 so
normaly ISOLatin1AccentFilterFactory have to replace the accent .......

any ideas ?

i use DataImportHandler.

If a mapping rule "è" to "e" is always true in your field, you can try to use MappingCharFilter instead of ISOLatin1AccentFilter. Add the following line to mapping-ISOLatin1Accent.txt:

"è" => "e"

and add the following fieldType:

<fieldType name="textCharNorm" class="solr.TextField" positionIncrementGap="100" >
 <analyzer>
<charFilter class="solr.MappingCharFilterFactory" mapping="mapping-ISOLatin1Accent.txt"/>
   <tokenizer class="solr.CharStreamAwareWhitespaceTokenizerFactory"/>
 </analyzer>
</fieldType>

MappingCharFilter and mapping-ISOLatin1Accent.txt are in nightly build.

Koji


Reply via email to