| cscott added a comment. |
FWIW, detecting sr-el and sr-ec may be easy, but distinguishing zh-hk from zh-cn is *not*. The CJK character block in particular has big overlap problems, dating back to when we were worried about having only 64k characters in unicode.
Could someone edit the phab summary to more clearly indicate what the task is here? @daniel's been working on figuring it out, but I'm still in the dark after reading the whole thread.
TASK DETAIL
EMAIL PREFERENCES
To: cscott
Cc: cscott, Nikola_Smolenski, Nikki, Liuxinyu970226, Filceolaire, Ricordisamoa, daniel, Aklapper, Amire80, GerardM, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331
Cc: cscott, Nikola_Smolenski, Nikki, Liuxinyu970226, Filceolaire, Ricordisamoa, daniel, Aklapper, Amire80, GerardM, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
