https://bugzilla.wikimedia.org/show_bug.cgi?id=70561
--- Comment #9 from Nik Everett <[email protected]> --- I've added a proposal to flatten all accented characters into non-accented ones for prefix search and exact title matches. It'll require rebuilding the index but that is no big deal. Note: I found out where the other normalization comes from. The French stemmer we use for inexact matches performs the following mappings: 'à', 'á', 'â' -> 'a' 'ô' -> 'o' 'è', 'é', 'ê' -> 'e' 'ù', 'û' -> 'u' 'î' -> 'i' 'ç' -> 'c' I could, if you believe it is more correct, only perform those mappings for the prefix and exact title matching. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
