https://bugzilla.wikimedia.org/show_bug.cgi?id=70899
Bug ID: 70899
Summary: Search box needs some normalization for Arabic Family
languages
Product: MediaWiki
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: Search
Assignee: [email protected]
Reporter: [email protected]
CC: [email protected]
Web browser: ---
Mobile Platform: ---
We have some langues such as Arabic, Persian, Urdu, Kurdish,... which uses
common characters and they have similar geliphs with different Unicode number
for example:
for ک (Kaf)
ك Arabic U+0643
ڪ Urdu U+06AA
ﻙ Pushtu U+FED9
ﻚ Uyghur U+FEDA
ک Persian U+06A9
for ی (ya)
ی Persian U+06CC
ي Arabic U+064A
ى Urdu U+0649
ۍ Pushtu U+06CD
ې Uyghur U+06D0
for ه (heh)
ہ Pushtu U+06C1
ە Kurdish U+06D5
ه Persian U+0647
we have these characters which have different Unicode number and different
keyboard.
Now many users does not access to Persian keyboard or urdu keyboard by default
in their OS (like windows xp, android (low versions), IOS ,...). so when they
search for an article they can not find it in wikipedia searach box but it is
existing in local characters.
For example if you search at fa.wikipedia for article ويليام شكسپير (characters
are in Arabic ي , ك) you can not find it and the article in Farsi is ویلیام
شکسپیر (characters are in Persian ی , ک).
for farsi please add a possibility for search tool to assume
U+064A or U+0649 or U+06CD or U+06D0 or U+06CC > U+06CC
U+0643 or U+06AA or U+FED9 or U+FEDA > U+06A9
U+06C1 or U+06D5 > U+0647
--
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l