cscott added a comment.

FWIW, detecting sr-el and sr-ec may be easy, but distinguishing zh-hk from zh-cn is *not*. The CJK character block in particular has big overlap problems, dating back to when we were worried about having only 64k characters in unicode.

Could someone edit the phab summary to more clearly indicate what the task is here? @daniel's been working on figuring it out, but I'm still in the dark after reading the whole thread.


TASK DETAIL
https://phabricator.wikimedia.org/T97882

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: cscott
Cc: cscott, Nikola_Smolenski, Nikki, Liuxinyu970226, Filceolaire, Ricordisamoa, daniel, Aklapper, Amire80, GerardM, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to