https://bugzilla.wikimedia.org/show_bug.cgi?id=50167
Web browser: ---
Bug ID: 50167
Summary: VisualEditor: Support unicode equivalence for client
side text searches
Product: VisualEditor
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: [email protected]
Reporter: [email protected]
CC: [email protected], [email protected]
Classification: Unclassified
Mobile Platform: ---
Planned features, such as searching for an existing reference by content, will
require us to implement some http://en.wikipedia.org/wiki/Unicode_equivalence .
We will probably want to use NFKD ("Characters are decomposed by compatibility,
and multiple combining characters are arranged in a specific order.") to catch
cases such as 'ff' === 'ff', and we will probably want to strip combining
characters (i.e. all accents), so that 'Amelie' === 'Amélie'.
https://github.com/walling/unorm looks like a good library for the job. We may
want to fork it into UnicodeJS.
--
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l