https://bugzilla.wikimedia.org/show_bug.cgi?id=18764
Summary: Search in yi: should ignore diacritics and identify
ligatures
Product: Wikimedia
Version: unspecified
Platform: All
OS/Version: All
Status: NEW
Severity: enhancement
Priority: Normal
Component: Language setup
AssignedTo: [email protected]
ReportedBy: [email protected]
This relates specifically to all projects using Yiddish (yi).
Yiddish has a number of ligatures. When a search term includes such a ligature
it should be able to identify the corresponding term spelt fully without using
the ligature.
Likewise searches should ignore the presence of diacritics which some writers
use.
Currently Wikimedia projects fail to make this identification. As a result it
is necessary to set up numerous synonyms for pages to catch alternative (but
essentially identical) spellings of the same word. This applies to almost every
word in the language.
By way of comparison, Google search makes the correct identifications. [English
Wikimedia projects successfully convert u/c letters in the middle of words.]
I can supply a list of Unicode codes to be identified.
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l