https://bugzilla.wikimedia.org/show_bug.cgi?id=70561

--- Comment #9 from Nik Everett <[email protected]> ---
I've added a proposal to flatten all accented characters into non-accented ones
for prefix search and exact title matches.  It'll require rebuilding the index
but that is no big deal.

Note:  I found out where the other normalization comes from.  The French
stemmer we use for inexact matches performs the following mappings:
'à', 'á', 'â' -> 'a'
'ô' -> 'o'
'è', 'é', 'ê' -> 'e'
'ù', 'û' -> 'u'
'î' -> 'i'
'ç' -> 'c'

I could, if you believe it is more correct, only perform those mappings for the
prefix and exact title matching.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to