https://bugs.freedesktop.org/show_bug.cgi?id=63460
Eike Rathke <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED Assignee|[email protected] |[email protected] |desktop.org | --- Comment #4 from Eike Rathke <[email protected]> --- Let's check the current status/assignments and how to transition to the assignments above. Because previously script tags weren't supported we defined a workaround that distinguishes them by different country assignments and we have LANGUAGE_USER_KURDISH_TURKEY 0x0626 /* sublang 0x01, Latin script */ [ku-TR] LANGUAGE_USER_KURDISH_SYRIA 0x0A26 /* sublang 0x02, Latin script */ [ku-SY] LANGUAGE_USER_KURDISH_IRAQ 0x0E26 /* sublang 0x03, Arabic script */ [ku-IQ] LANGUAGE_USER_KURDISH_IRAN 0x1226 /* sublang 0x04, Arabic script */ [ku-IR] First said, 'ku' is an ISO 639-1 macrolanguage code and should be avoided in distinct language assignments where more specific ISO 639-1 or 639-3 codes exist. That should be 'ckb', 'kmr' and 'sdh' instead, where 'ckb' and 'sdh' already imply Arabic script. So according to that PDF Microsoft assigned Central Kurdish 0x0092 /* sublang 0x00, Latin script? */ [ku] Central Kurdish 0x7C92 /* sublang 0x1f, Arabic script */ [ku-Arab] Central Kurdish Iraq 0x0492 /* sublang 0x01, Arabic script */ [ku-Arab-IQ] and no other Kurdish tags. Great :-/ http://msdn.microsoft.com/library/dd318693.aspx mentions _only_ 0x0492 and nothing else, you gotta love'em :-( At least it can be detected that since Windows 7 they seem to use sublanguage IDs in the range 0x1d to 0x1f for primary language with script type and without country. Which doesn't help much because 0x0492 can not be deduced from that and is rather 0x0092 with the default 0x01 sublanguage ID. When defining new language entries for LibreOffice we should always include a country to form a default locale of that language to make it available for locale fallback mechanisms. So here the results would be something like LANGUAGE_KURDISH_ARABIC_IRAQ 0x0492 [ckb-IQ] LANGUAGE_KURDISH_ARABIC_IRAQ 0x0492 [ku-Arab-IQ] LANGUAGE_KURDISH_ARABIC_IRAQ 0x0492 [ku-IQ] LANGUAGE_OBSOLETE_USER_KURDISH_IRAQ 0x0E26 [ckb-IQ] with mappings to the preferred ISO 639-3 code and the ISO 639-1 macrolanguage code and the old values and codes, with #define LANGUAGE_OBSOLETE_USER_KURDISH_IRAQ 0x0E26 #define LANGUAGE_USER_KURDISH_IRAQ LANGUAGE_KURDISH_ARABIC_IRAQ How to transition the other three, ku-TR, ku-SY and ku-IR? ku-TR would be kmr-Latn-TR, Northern Kurdish Latin script. ku-SY would be kmr-Latn-SY, Northern Kurdish Latin script. But ku-IR? Which Kurdish would it be? ckb-IR? The defaults for the new languages would be Kurdish, Northern, Latin script [kmr-Latn-TR] Kurdish, Central, Arabic script [ckb-IQ] Kurdish, Southern, Arabic script [sdh-IQ] References I consulted http://www.ethnologue.com/language/kmr http://www.ethnologue.com/language/ckb http://www.ethnologue.com/language/sdh -- You are receiving this mail because: You are the assignee for the bug.
_______________________________________________ Libreoffice-bugs mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
