https://bugzilla.wikimedia.org/show_bug.cgi?id=50167

       Web browser: ---
            Bug ID: 50167
           Summary: VisualEditor: Support unicode equivalence for client
                    side text searches
           Product: VisualEditor
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: General
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected], [email protected]
    Classification: Unclassified
   Mobile Platform: ---

Planned features, such as searching for an existing reference by content, will
require us to implement some http://en.wikipedia.org/wiki/Unicode_equivalence .

We will probably want to use NFKD ("Characters are decomposed by compatibility,
and multiple combining characters are arranged in a specific order.") to catch
cases such as 'ff' === 'ff', and we will probably want to strip combining
characters (i.e. all accents), so that 'Amelie' === 'Amélie'.

https://github.com/walling/unorm looks like a good library for the job. We may
want to fork it into UnicodeJS.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to