https://bugzilla.wikimedia.org/show_bug.cgi?id=18764

           Summary: Search in yi: should ignore diacritics and identify
                    ligatures
           Product: Wikimedia
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: Language setup
        AssignedTo: [email protected]
        ReportedBy: [email protected]


This relates specifically to all projects using Yiddish (yi).

Yiddish has a number of ligatures. When a search term includes such a ligature
it should be able to identify the corresponding term spelt fully without using
the ligature.

Likewise searches should ignore the presence of diacritics which some writers
use.

Currently Wikimedia projects fail to make this identification. As a result it
is necessary to set up numerous synonyms for pages to catch alternative (but
essentially identical) spellings of the same word. This applies to almost every
word in the language.

By way of comparison, Google search makes the correct identifications. [English
Wikimedia projects successfully convert u/c letters in the middle of words.]

I can supply a list of Unicode codes to be identified.


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to