I think you can use ASCIIFoldingFIlter
http://lucene.apache.org/core/6_2_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html
by inserting its factory in your schema.
http://lucene.apache.org/core/6_2_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilterFactory.html
I would suggest making a separate field for this so that exact match can be
boosted.
On 10/29/17 10:56 AM, Robert Brown wrote:
Hi,
I have a text field in my index containing extended characters, which I'd like
to match against when searching without the extended characters.
e.g. field contains "Ensō" which I want to match when searching for just
"enso".
My current config for that field (type) is given below:
autoGeneratePhraseQueries="true">
synonyms="index_synonyms.txt" ignoreCase="true" expand="true" />
words="lang/stopwords_en.txt" />
words="lang/stopwords_en.txt" />
Kuro