aerox7 wrote:
Hi,
I have a mysql data base in UTF-8. I have a row with "Solène" (solène). I
want to transforme this to solene, so i use Solr
ISOLatin1AccentFilterFactory to perform this task but it dosn't work ?!!
i gess that "Solène" is "solène" in UTF-8 ?! i also set tomcat to utf-8 so
normaly ISOLatin1AccentFilterFactory have to replace the accent .......
any ideas ?
i use DataImportHandler.
If a mapping rule "è" to "e" is always true in your field, you can try
to use MappingCharFilter
instead of ISOLatin1AccentFilter. Add the following line to
mapping-ISOLatin1Accent.txt:
"è" => "e"
and add the following fieldType:
<fieldType name="textCharNorm" class="solr.TextField"
positionIncrementGap="100" >
<analyzer>
<charFilter class="solr.MappingCharFilterFactory"
mapping="mapping-ISOLatin1Accent.txt"/>
<tokenizer class="solr.CharStreamAwareWhitespaceTokenizerFactory"/>
</analyzer>
</fieldType>
MappingCharFilter and mapping-ISOLatin1Accent.txt are in nightly build.
Koji